Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
48 commits
Select commit Hold shift + click to select a range
4604ffd
Create proposal.md
jacobkingery Feb 26, 2016
5f5ce87
added dataset
Feb 26, 2016
cd187cd
started geocoding
Mar 1, 2016
f92e8dc
started exploration
Mar 1, 2016
3a588a6
added gitignore
Mar 1, 2016
45cabcc
fixed lat/lon plotting
Mar 1, 2016
b0079aa
changed to accept multiple geocode responses, but only use if close t…
Mar 1, 2016
f1fa0c1
started new geocoding
Mar 1, 2016
0544b79
added more geocodes
Mar 2, 2016
9cc588a
ran more geocodes
Mar 3, 2016
8799097
merging
MacIElson Mar 3, 2016
0d89de3
upload csv with all alts and longs filled in, started with Nominatim …
MacIElson Mar 3, 2016
d2bf92d
add bing map csv
MacIElson Mar 3, 2016
6fd72b6
upload csv with google's geolocate
MacIElson Mar 3, 2016
97dd1d3
owrked on exploartion notebook
MacIElson Mar 3, 2016
d8ccf5c
add blank mid project checkin document
MacIElson Mar 4, 2016
ced415a
Update mid_project_checkin.md
MacIElson Mar 4, 2016
0070b2a
changed scatter plot to hexbin plot
Mar 4, 2016
2f282b0
commit before merge
MacIElson Mar 4, 2016
5133de6
added files we worked on in class
MacIElson Mar 7, 2016
e3d5f0d
Merge branch 'master' of https://github.com/MaciCrowell/DataScience16CTW
MacIElson Mar 7, 2016
9df6f48
added baseline models
Mar 7, 2016
329b563
changed to train_test_split to stratify correctly
Mar 7, 2016
466032d
added initial linear regression model
Mar 7, 2016
b4253ea
added function to do cross validation
MacIElson Mar 8, 2016
5bf70ac
fixed merge conflict
MacIElson Mar 8, 2016
5f596a7
tweaked linear regression features
Mar 8, 2016
68d029b
tried k-nn model
Mar 9, 2016
ff0db86
merging
MacIElson Mar 9, 2016
41dcdec
Merge branch 'master' of https://github.com/MaciCrowell/DataScience16CTW
MacIElson Mar 9, 2016
697b00c
forced hospital seperation
MacIElson Mar 9, 2016
3b85505
did k-NN parameter sweeps
Mar 10, 2016
e371b21
Create FinalOutputandReflection.md
MacIElson Mar 11, 2016
e3b8b4b
Update FinalOutputandReflection.md
MacIElson Mar 11, 2016
77bfcb7
added grid search for random forest and documented geolocate2.py
MacIElson Mar 11, 2016
35f235a
Merge branch 'master' of https://github.com/MaciCrowell/DataScience16CTW
MacIElson Mar 11, 2016
1a6d4ee
Update FinalOutputandReflection.md
MacIElson Mar 11, 2016
ef2f845
started writeup
Mar 11, 2016
cd74499
Merge branch 'master' of github.com:MaciCrowell/DataScience16CTW
Mar 11, 2016
803f5b9
Update FinalOutputandReflection.md
MacIElson Mar 11, 2016
f780538
ran exploration notebook final time
Mar 11, 2016
b13319f
Merge branch 'master' of github.com:MaciCrowell/DataScience16CTW
Mar 11, 2016
e081f62
added model to writeup
MacIElson Mar 11, 2016
283c3c0
Merge branch 'master' of https://github.com/MaciCrowell/DataScience16CTW
MacIElson Mar 11, 2016
3cb840e
added more to writeup
Mar 11, 2016
8799291
Merge branch 'master' of github.com:MaciCrowell/DataScience16CTW
Mar 11, 2016
b26a4e1
Update FinalOutputandReflection.md
jacobkingery Mar 11, 2016
4ddd062
ran writeup notebook
Mar 11, 2016
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
.ipynb_checkpoints/
*.pyc
*.zip
31 changes: 31 additions & 0 deletions FinalOutputandReflection.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
#### Assessment evidence and interpretation:

Our initial assessment plan: "It should be assessed by how well we conducted our exploration and model creation, as well as how well we achieve our learning goals."

In order for you to assess us in this manner we think it would be helpful for you to look at the following files/notebooks:

data_exploration.ipynb - this is were we explored the data

geolocate.py (initial attempt with open sources) and geolocate2.py (final attempt using google API) - this was our code
for geocoding the addresses of hospitals into latitude and longitude

loadAndClean.py - this is where we prepared our data for modeling

model_exploration.ipynb - our exploration of different models

knn_gridsearch.ipynb - k-Nearest Neighbors grid search

RandomForest_gridsearch.ipynb - Random Forest grid search

writeup.ipynb - our final writeup


#### Changing the world:

We do not think that the output of our project will change the world (other than in a butterfly effect fashion). We weren't able to produce a very acurate model, and none of the visualizations we developed were intened to be anything other than exploration tools for ourselves. We hope that seeing our project might inspire someone else to do something impactful but that's about all the impact we foresee this having.

#### Learning goals:

Our stated learning goals were: "We want to learn more about the healthcare system ourselves as well as gain experience in taking a data set, understanding it, and making it useful."

Overall we feel we achieved our learning goals. We did learn more about how Medicare works from a billing/organizational perspective. There is still a huge amount we don't know, but we think we made progress in becoming more informed. We also feel like we absolutely gained more experience in taking a data set, transfomring the data and trying to make it useful (even if we're not sure how useful our final output is).
153,601 changes: 153,601 additions & 0 deletions IPPS_2013.csv

Large diffs are not rendered by default.

Loading