banner

Data and Processing


Police Department Incident Reports: Historical 2003 to May 2018

- File Type: CSV
- File Size: 30.6 MB
- Source: Data SF
- License: ODC Public Domain Dedication and Licence (PDDL)
- Rows: 149,636
- Columns: 13
- Description: This dataset includes police incident reports filed by officers and by individuals through self-service online reporting for non-emergency cases. (From the description on the website)

Contents used:
- Category: Type of incident reported and recorded by the police.
- PdDistrict: The police district that the incident occurred in.


Police Department Incident Reports: 2018 to Present

- File Type: CSV
- File size: 53.3 MB
- Source Data SF
- License: ODC Public Domain Dedication and Licence (PDDL)
- Rows: 154,470
- Columns: 26
- Description: This dataset includes police incident reports filed by officers and by individuals through self-service online reporting for non-emergency cases. (From the description on the website)

Contents used:
- Incident Category: Type of incident reported and recorded to the police.
- Police District: The police district that the incident occurred in.


Current Police Districts

- File Type: GeoJSON
- File size: 309 KB
- Source Data SF
- License: ODC Public Domain Dedication and Licence (PDDL)
- Description: Map of San Francisco represented by SFPD Districts after July 19th, 2015.

Initial Processing

I filtered the data with Tableau conforming to my goals, which is to look at offensive incidents in the police reports and getting the total number of offensive incident records. I then created a few of my own CSV files with Atom for the filtered data to implement more efficiently.