Skip to content

jgeng99/San-Diego-Accident-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

San Diego Accident Analysis

Authors: Yacun Wang, Jianming Geng, Huy Trinh
June 16th, 2023

Problem Statement

Improve profitability of an insurance company by understanding patterns and factors contributing to accidents in the city of San Diego.

Requirements

Code Environment

Throughout the analysis we're using Python in Jupyter Notebooks to run data processing, insertion, and analysis. Specific Python packages listed in the notebooks could be downloaded via pip install <package>.

Databases

We use two databases to store our data:

  • PostgreSQL Logo (Relational Database): Version 13+
  • Neo4j Logo (Graph Database):
    • Base Kernel: Version 5.7.0
    • Awesome Procedures on Cypher (APOC) Library: Version 5.7.0
    • Graph Data Science (GDS) Library: Version 2.3.7

Data

Raw Data Sources

We are using San Diego Data Portal for our raw data, including information on the accidents, Get-It-Done Reports, and roads:

Running data preprocessing requires Python GIS packages and an ArcGIS Online Account with ESRI credits, so we don't recommend running the preprocessing as we have prepared processed data.

Processed Data

All processed data are created and stored under the Shared Google Drive. Download these data and store them in the data directory, as well as the "neo4j-docker - GDS"/db/import directory.

Analysis Outline

As suggested by the data sources and their relationships, all analysis questions are separated into 4 parts:

  1. Accident Information
  2. Accident Vehicle Information
  3. Accident Road Information
  4. Accident Information in Relation to Get-It-Done Reports

Main Files Outline

San-Diego-Accident-Analysis/
├── data/                            <- all processed data files
│   ├── accidents.csv
│   ├── accidents_info.csv
│   ├── roads.csv
│   ├── reports.csv
│   ├── accidents_on_road.csv
│   └── reports_on_road.csv
├── src/                             <- all code
│   ├── preprocessing.ipynb
│   ├── data-loading.ipynb
│   └── data-analysis.ipynb
└── README.md
Logo

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published