Skip to content

This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more.

Notifications You must be signed in to change notification settings

dharmendradiwaker/Exploratory-Data-Analysis-of-Los-Angeles-Crime-Data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Exploratory Data Analysis of Los Angeles Crime Data πŸš”πŸ“Š

Overview

This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more. The insights are visualized using various tools, including Pandas, Seaborn, Plotly, and Geo Heatmap.

Key Findings πŸ”

  • Over 50% of crime victims are aged between 21-40 years. πŸ‘ΆπŸ‘¨β€πŸ¦±
  • More than 50% of victims are male. 🚹
  • Over 20% of crimes were committed using deadly weapons. πŸ”ͺ🧨

Dataset πŸ“

The dataset contains more than 802,000 records, each representing a unique crime. It includes 28 columns such as:

  • Crime Type πŸ”
  • Victim Demographics πŸ§‘β€πŸ¦³
  • Crime Location πŸ“
  • Time of Crime ⏰
  • Weapon Used πŸ”«

Requirements πŸ“‹

  • Python (version 3.6 or higher recommended)
  • Required Python libraries:
    • Pandas (for data manipulation) πŸ“Š
    • Seaborn (for data visualization) 🎨
    • Plotly (for interactive visualizations) πŸ“‰
    • GeoPandas (for geographical data visualizations) 🌍
    • Matplotlib (for additional plotting support) πŸ–ΌοΈ

You can install these dependencies using pip:

pip install pandas seaborn plotly geopandas matplotlib

Analysis and Visualizations πŸ“Š

  • Data Cleaning: Missing values handled, and data was preprocessed for analysis.
  • Visualization Techniques:
    • Scatter Plots: To understand correlations and trends.
    • Pie Charts: For visualizing categorical distributions, like the gender of victims.
    • Geo Heatmap: To visualize crime hotspots across Los Angeles on a map.
    • Histograms: To understand the distribution of victims' ages and crime types.

Setup βš™οΈ

  1. Clone this repository to your local machine:

    git clone https://github.com/dharmendradiwaker/Exploratory-Data-Analysis-of-Los-Angeles-Crime-Data-.git
  2. Navigate to the project folder:

    cd exploratory-data-analysis-los-angeles-crime
  3. Install the required libraries:

    pip install -r requirements.txt
  4. Run the Jupyter Notebook or Python script:

    jupyter notebook Crime_EDA.ipynb

How to Use πŸ§‘β€πŸ’»

  1. Load the dataset from the provided CSV file in the data/ folder.
  2. Follow the notebook to perform EDA and visualize the results.
  3. Explore the plots and heatmaps to gain insights into crime trends and victim demographics.

Conclusion πŸ“Œ

The analysis provides valuable insights into crime patterns in Los Angeles, especially concerning the age and gender of victims, weapon usage, and the geographic distribution of crimes. This project can help inform policy and improve crime prevention strategies.

Contributors πŸ™‹β€β™‚οΈ

  • @Dharmendradiwaker12

About

This project involves performing exploratory data analysis (EDA) on a large dataset containing over 802,000+ crime records from Los Angeles. The dataset includes 28 columns, and the analysis focuses on understanding crime patterns, victim demographics, crime types, and more.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published