-
Notifications
You must be signed in to change notification settings - Fork 1
/
Copy pathindex.Rmd
157 lines (96 loc) · 7.37 KB
/
index.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
---
title: "NYC subway crime"
output:
html_document:
includes:
in_header: ga_script.html
toc: true
toc_float: true
code_folding: hide
editor_options:
chunk_output_type: console
---
<link rel="stylesheet" href="academicons.css"/>
<link rel="stylesheet" href="styles.css" type="text/css">
#### Zheyan Liu, Chaoqi Wu, Baode Gao, Youyuan Kong, Zexu Yuan
#### Columbia University, Mailman School of Public Health
<br>
## Motivation
New York subway, one of the main public transportations for New Yorkers, provides super convenience for local citizens, at the same time, brings potential danger to passengers, where criminals are attracted to busier subway stations for certain kinds of crime like pick pocketing, grand larceny and assault. The cloest train compartment can trigger cime and make victims harder to run.
<br>
<center>
{width=60%}
</center>
<br>
11/21/2021, around 12:00 AM, at 34th Street-Penn Station in Manhattan, Alkeem Loney, a 32-year-old male, was stabbed in the neck during an unprovoked attack and was pronounced dead later as NYPD stated. The deadly incident is the latest in a pate of violence underground that comes as the MTA tries to get commuters back on mass transit. The horrible crime event raised lots of public concern about the safety at subway stations, the safety tightly related to almost every citizen who are living, working and studying in New York City.
As students who are living here in New York City, most of us will almost take subway to the campus in early morning and back to apartment in nights on weekdays, and hang out with friends on weekends. Keeping away from danger at subway stations is closely related to ourselves. We hope we are able to help citizens to find the comparatively safe and reliable routes when taking subways.
## Data
### Subway Crime
The orginal subway crime data has two parts.[The first one ](https://data.cityofnewyork.us/Public-Safety/NYPD-Complaint-Data-Current-Year-To-Date-/5uac-w243) contains all valid felony, misdemeanor, and violation crimes reported to the New York City Police Department--- [NYPD](https://www1.nyc.gov/site/nypd/index.page). The [second one](https://data.cityofnewyork.us/Public-Safety/NYPD-Complaint-Data-Historic/qgea-i56i) includes similar crimes. We join these two data frames and only analyze crimes which happen in subway, NYC.
### Subway Passenger
The orginal [Subway passenger data](http://web.mta.info/developers/turnstile.html) is from [MTA](https://new.mta.info/)(Metropolitan Transportation Authority). The orginal data contains total entries and exits in each station in every 4 hours from 2010 to now.
## What you can find in this website
* Exploratory Data Analysis of subway crime with [time](events_and_time.html) (year, month, hour)
* Exploratory Data Analysis of subway crime with [location](events_and_location.html) (coordiante-based Kmeans cluster, subway line, station)
* [Analysis of passengers flow](PassengerEDA.html) of subway and its relationship with the location; How we use coordinates-based Kmeans on this analysis.
* An [interactive Shiny Dashboard](https://stephenyuan.shinyapps.io/CrimeMapShiny/) about subway crime, people can look up crime rate in each location, distribution of each crime type.
* A shiny app for [Subway passenger flow animation and info lookup on each line](https://chaoqiwu0324.shinyapps.io/subway_passenger/)
* A [GNN](https://en.wikipedia.org/wiki/Graph_neural_network) model for predicting existence and severity of subway crime
* [No-crime Navigation shiny app](https://zheyanliu.shinyapps.io/NYC_subway_findroute) based on [Google Maps Api](https://developers.google.com/maps) (give subway route), GNN model (predict crime existence) and historical passenger data (give crowdness of a route).
## Team members
<img src = "images/Zheyan2.jpg" style = "width:26%; float:left; padding-right:25px; padding-bottom:25px">
**Zheyan Liu**
[MS in Biostatistics at Columbia University](http://www.mailman.columbia.edu/academic-departments/biostatistics)
Email: [[email protected]](mailto:[email protected])
[Website](https://zheyanliu99.github.io/), [Linkedin](https://www.linkedin.com/in/zheyanliu999/), [Github](https://github.com/zheyanliu99?tab=repositories)
*Contributions*
* 1 Cleaned and imputed subway passenger data; Built project website and orgnized final report
* 2 Conduct EDA on subway passenger data&location; Feature engineering on coordinates with K-means
* 3 Built subway naviagtion app based on Google Maps Apis and adapted GNN into crime prediction on each route
<br>
<br>
<br>
<img src = "images/Chaoqi.jfif" style = "width:26%; float:left; padding-right:25px; padding-bottom:25px">
**Chaoqi Wu**
[MS in Biostatistics at Columbia University](http://www.mailman.columbia.edu/academic-departments/biostatistics)
Email: [[email protected]](mailto:[email protected])
[Website](https://chaoqiw0324.github.io/Chaoqi-Wu.github.io/), [Linkedin](https://www.linkedin.com/in/%E8%B6%85%E7%A5%BA-%E5%90%B4-52426a222/), [Github](https://github.com/chaoqiw0324)
*Contributions*
* 1 Cleaned the subway passenger data, clarify the relation between station and lines.
* 2 Conduct EDA on subway passenger data, analyze the relationship among passenger, line and time
* 3 Built subway passenger app providing search function based on line and time.
<br>
<img src = "images/Baode.jpg" style = "width:26%; float:left; padding-right:25px; padding-bottom:25px">
**Baode Gao**
[MS in Biostatistics at Columbia University](http://www.mailman.columbia.edu/academic-departments/biostatistics)
Email: [[email protected]](mailto:[email protected])
[Website](https://buddgao.github.io/Baode_Gao.github.io/), [Linkedin](https://www.linkedin.com/in/baode-gao-389249176/), [Github](https://github.com/BuddGao?tab=repositories)
*Contributions*
* 1 Cleaned, grouped and transformed data to the form GNN accepted;
* 2 Implemented graph auto encoder to our data, evaluated it and adapted it into crime prediction on each route;
* 3 Contributed in model report in web.
<br>
<img src = "images/Youyuan.jpg" style = "width:26%; float:left; padding-right:25px; padding-bottom:25px">
**Youyuan Kong**
[MS in Biostatistics at Columbia University](http://www.mailman.columbia.edu/academic-departments/biostatistics)
Email: [[email protected]](mailto:[email protected])
[Website](https://keviant.github.io/p8105_hw4_yk2960.github.io/#Dashboard),
[Linkedin](https://www.linkedin.com/in/youyuan-kong-a983b91b2/),
[Github](https://github.com/Keviant?tab=repositories)
*Contributions*
* 1 Clean and imputed crime by location data;create a crime rate shiny dashboard, and participate in writing report
* 2 Conduct EDA on subway passenger data&location
<br>
<br>
<br>
<br>
<img src = "images/pic.jpg" style = "width:26%; float:left; padding-right:25px; padding-bottom:25px">
**Zexu Yuan**
[MPH in Biostatistics at Columbia University](http://www.mailman.columbia.edu/academic-departments/biostatistics)
Email: [[email protected]](mailto:[email protected])
[Website](https://zexuyuan.github.io/Stephen_Yuan.github.io/),
[Linkedin](https://www.linkedin.com/in/zexuyuan/),
[Github](https://github.com/ZexuYuan)
*Contributions*
* 1 Conducted exploratory data analysis regarding the relationship between occurrence of crime events and time
* 2 Contributed to establishment of shiny dashboard with regard to crime events map