Down Shift

verify-tagChicago Crime

crimebeginnerdata analyticsxgboostbinary classification

5

已售 0
10.15MB

数据标识:D17171508241356228

发布时间:2024/05/31

数据描述

This dataset has gained popularity over time and is widely known. While Kaggle courses teach how to use Google BigQuery to extract a sample from it, this notebook provides a HOW-TO guide to access the dataset directly within your own notebook. Instead of uploading the entire dataset here, which is quite large, I offer several alternatives to work with a smaller portion of it. My main focus was to demonstrate various techniques to make the dataset more manageable on your own laptop, ensuring smoother operations. Additionally, I've included some interesting insights on basic descriptive statistics and even a modeling example, which can be further explored based on your preferences. I intend to revisit and refine it in the near future to enhance its rigor. Meanwhile, I welcome any suggestions to improve the notebook!

Here are the columns that I have chosen to include (after carefully eliminating a few others):

  • Date: This column represents the timestamp of the incident. From this column, I have extracted the Month, Day, and Hour information. We can also add additional time-based columns such as Week and Day of the Week, among others.
  • Block: This column provides a partially redacted address where the incident occurred, indicating the same block as the actual address.
  • IUCR: The acronym stands for Illinois Uniform Crime Reporting. This code is directly linked to the Primary Type and Description. You can find more information about it in this link.
  • Primary Type: This column describes the primary category of the IUCR code mentioned above.
  • Description: This column provides a secondary description of the IUCR code, serving as a subcategory of the primary description.
  • Location Description: Here, you can find the description of the location where the incident took place.
  • Arrest: This column indicates whether an arrest was made in relation to the incident.
  • Domestic: It shows whether the incident was domestic-related, as defined by the Illinois Domestic Violence Act.
  • Beat: The beat refers to the smallest police geographic area, with each beat having a dedicated territory. You can find more information about it in this link.
  • District: This column represents the police district where the incident occurred.
  • Ward: It refers to the number that labels the City Council district where the incident took place.
  • Community Areas: This column indicates the community area where the incident occurred. Chicago has a total of 77 community areas.
  • FBI Code: The crime classification outlined in the FBI's National Incident-Based Reporting System (NIBRS).
  • X-Coordinate, Y-Coordinate, Latitude, Longitude, Location: These columns provide information about the geographical coordinates of the incident location, including latitude and longitude. The "Location" column contains just the latitude and longitude coordinates.
  • Year, Updated On: These columns represent the year of the incident and the date on which the dataset was last updated.

Feel free to explore the notebook and provide any suggestions for improvement. Your feedback is highly appreciated!

验证报告

以下为卖家选择提供的数据验证报告:

data icon
Chicago Crime
5
已售 0
10.15MB
申请报告