Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
173 commits
Select commit Hold shift + click to select a range
e5e7639
check in the notebooks for cuDF example
yidong72 Dec 14, 2018
c25a0dc
modified: cuml/knn_demo.ipynb
genesisfactor Mar 22, 2019
4c33538
removed benchmarks notebook due to matlibplot dependancy. Will put i…
genesisfactor Mar 22, 2019
38b7409
Merge pull request #101 from taureandyernv/06fixes
taureandyernv Mar 22, 2019
5d35ed0
Removed Blog notebook as it was transferred to notebooks extended
genesisfactor Mar 22, 2019
5835541
Merge pull request #102 from taureandyernv/06fixes
taureandyernv Mar 22, 2019
808eb5f
Removed extra test code in KNN
genesisfactor Mar 22, 2019
e74e7d9
fixed path
genesisfactor Mar 22, 2019
277b4dd
Merge pull request #103 from taureandyernv/06fixes
taureandyernv Mar 22, 2019
8eb0e17
Updating README.md files to match current notebooks.
rlratzel Mar 29, 2019
54a3ca6
Updates to README.md to match current notebooks.
rlratzel Mar 29, 2019
d96617d
Additional updates to provide full set of runtime numbers and to fix …
rlratzel Mar 29, 2019
4cbfc02
Merge pull request #115 from rlratzel/master
raydouglass Mar 29, 2019
5a9ca6d
Merge pull request #114 from rapidsai/branch-0.6-release
raydouglass Mar 29, 2019
a0741ba
Update ridge_regression_demo.ipynb
Apr 2, 2019
181b79e
Update ridge_regression_demo.ipynb
Apr 2, 2019
3538377
Update ridge_regression_demo.ipynb
Apr 2, 2019
25aaacd
Update linear_regression_demo.ipynb
Apr 11, 2019
b1816ec
Update ridge_regression_demo.ipynb
Apr 11, 2019
3e3a8cb
Update dbscan_demo.ipynb
Apr 12, 2019
cb03682
Update dbscan_demo.ipynb
Apr 15, 2019
651f6e5
Update knn_demo.ipynb
Apr 16, 2019
2636219
Update knn_demo.ipynb
Apr 16, 2019
23161fa
Update pca_demo.ipynb
Apr 16, 2019
6a623f8
Update tsvd_demo.ipynb
Apr 16, 2019
b3fa7bd
Update tsvd_demo.ipynb
Apr 16, 2019
c55284d
Update umap_demo.ipynb
Apr 16, 2019
05e8d02
Update umap_demo.ipynb
Apr 16, 2019
a4d960e
Update umap_demo.ipynb
Apr 16, 2019
e3300d2
Update tsvd_demo.ipynb
Apr 16, 2019
68b0e29
Update umap_demo.ipynb
Apr 17, 2019
d0215fa
Update umap_demo.ipynb
Apr 17, 2019
6a4b478
Update pca_demo.ipynb
Apr 17, 2019
458a9d4
Update umap_demo.ipynb
Apr 17, 2019
d6e3f56
Update umap_demo.ipynb
Apr 17, 2019
546c430
Adding Vertex Similarity notebook and related folders
BradReesWork Apr 26, 2019
c084060
Create a notebook to demonstrate vertex renumbering
ChuckHastings Apr 26, 2019
78104f7
PageRank and Weighted Jaccard Notebooks
BradReesWork Apr 30, 2019
38caf70
Louvain notebook
BradReesWork Apr 30, 2019
2122ff2
Update dbscan_demo.ipynb
Apr 30, 2019
b72ef42
Update dbscan_demo.ipynb
Apr 30, 2019
aaa471f
Update dbscan_demo.ipynb
Apr 30, 2019
63cf7b9
added new image for pagerank, updated syntax in Louvain
BradReesWork May 1, 2019
1b091dc
Added BFS
BradReesWork May 1, 2019
6031687
updated
BradReesWork May 1, 2019
697a8c4
Added SSSP
BradReesWork May 1, 2019
d18fa88
Create coordinate_descent.ipynb
May 1, 2019
3ccd88b
Update coordinate_descent.ipynb
May 1, 2019
c2c8024
Update coordinate_descent.ipynb
May 1, 2019
f557dcf
Update coordinate_descent.ipynb
May 1, 2019
632e212
Update coordinate_descent.ipynb
May 1, 2019
71b4e86
Merge pull request #127 from ChuckHastings/fea_renumbering_vertices
taureandyernv May 3, 2019
2038340
Updated PageRank
BradReesWork May 3, 2019
136ef92
Spectral Cluster, both Balanced Cut and Modularity
BradReesWork May 3, 2019
40e6c36
Added Triangle Counting
BradReesWork May 3, 2019
6a461c1
removed some whitespace and empty cells
taureandyernv May 3, 2019
9237c95
fixed num_verticies() error
taureandyernv May 3, 2019
3fd8d5e
added veribage on sort warning
taureandyernv May 3, 2019
cd127a3
Merge pull request #126 from BradReesWork/branch-0.7
taureandyernv May 3, 2019
753ca83
added verbiage on the reaining notebooks
taureandyernv May 3, 2019
a5df0e4
Update knn_demo.ipynb
May 4, 2019
fba0ee1
Update ridge_regression_demo.ipynb
May 4, 2019
503f3fc
Update sgd_demo.ipynb
May 4, 2019
945142f
Update sgd_demo.ipynb
May 4, 2019
8d63983
added umap and decomposition notebooks
taureandyernv May 4, 2019
5ccc3e6
Merge pull request #130 from Salonijain27/coordinate_descent
taureandyernv May 6, 2019
161a72a
Update link to fannie mae mortgage data
joshwyatt May 6, 2019
04546b2
Merge pull request #131 from joshwyatt/fix-mortgage-data-links
taureandyernv May 6, 2019
ce2c7fa
0.7 notebook clean up
taureandyernv May 7, 2019
5163b4c
Merge remote-tracking branch 'upstream/master' into branch-0.7
taureandyernv May 7, 2019
3152ee4
Merge pull request #7 from rapidsai/branch-0.8
May 7, 2019
8721e7f
Update README.md
May 7, 2019
1838040
Update README.md
May 7, 2019
85538d6
Merge pull request #8 from rapidsai/branch-0.7
May 7, 2019
71c9853
Update README.md
May 7, 2019
8adc7a6
Update README.md
May 7, 2019
c923423
Update README.md
May 7, 2019
3fbfad1
Update README.md
May 7, 2019
3585c01
Update README.md
May 7, 2019
0785733
Update README.md
May 7, 2019
c2da80f
Update README.md
May 7, 2019
d00c821
added updated cugraph notebooks
taureandyernv May 7, 2019
6196ad9
Merge pull request #135 from Salonijain27/updated-readme
taureandyernv May 7, 2019
13fd44c
Merge branch 'branch-0.7' of https://github.com/rapidsai/notebooks in…
taureandyernv May 7, 2019
856fe18
updated the readme with the rest of the cugraph notebooks, and small …
taureandyernv May 8, 2019
48e14b5
Merge pull request #136 from taureandyernv/readmeupdate
taureandyernv May 8, 2019
9d29b42
Merge pull request #137 from taureandyernv/readmeupdate
taureandyernv May 8, 2019
d8a7dd4
Update README.md
taureandyernv May 8, 2019
fd8e6b9
Merge pull request #138 from taureandyernv/readmeupdate
taureandyernv May 8, 2019
d21ed5f
Merge pull request #139 from taureandyernv/readmeupdate
taureandyernv May 8, 2019
55c279b
added UMAP supervised notebook. Edited UMAP graphed notebook. Change…
taureandyernv May 8, 2019
b607aca
updated umap supervised
taureandyernv May 8, 2019
3147bfb
Merge pull request #140 from taureandyernv/umapupdates
taureandyernv May 8, 2019
4ff8539
Merge pull request #141 from taureandyernv/umapupdates
taureandyernv May 8, 2019
2e69d12
Switching verbose to False for UMAP() API call.
rlratzel May 9, 2019
b9f0e12
Merge pull request #143 from rlratzel/branch-0.7
cjnolet May 9, 2019
b68e76f
Added a Changelog
BradReesWork May 9, 2019
51e7942
PR 144 - ChnageLog
BradReesWork May 9, 2019
86345fe
cuGraph README
BradReesWork May 10, 2019
37005fa
Updated to fix typos
BradReesWork May 10, 2019
0662d7f
Merge pull request #144 from BradReesWork/branch-0.7
taureandyernv May 10, 2019
b64f970
Merge pull request #9 from rapidsai/branch-0.7
May 10, 2019
b590d23
Update README.md
May 10, 2019
cca9d8c
Merge pull request #146 from Salonijain27/updated-readme
taureandyernv May 10, 2019
1e7dd35
Update sgd_demo.ipynb
May 16, 2019
1982209
Update sgd_demo.ipynb
May 16, 2019
58387f8
Update sgd_demo.ipynb
May 16, 2019
c54718a
Merge pull request #157 from Salonijain27/bug-fix-sgd
taureandyernv May 21, 2019
a2c9e85
Links to notebooks in README.md
divyegala May 21, 2019
e4b2893
Merge pull request #158 from divyegala/notebook-links
taureandyernv May 21, 2019
1230653
Remove unused import from sgd_demo
divyegala May 22, 2019
1957704
Removed unused Timer class
divyegala May 22, 2019
27ad673
Merge pull request #159 from divyegala/issue-fixes
taureandyernv May 28, 2019
adf89da
Random Forest Demo
divyegala Jun 18, 2019
0a402d4
Update RF Demo
divyegala Jun 18, 2019
f624b8f
Update RF Demo
divyegala Jun 18, 2019
5b6bfd8
Update RF Demo
divyegala Jun 18, 2019
c07eb43
Notebook description and option to download dataset
divyegala Jun 19, 2019
ac41346
Review changes
divyegala Jun 20, 2019
5a4e809
Review changes
divyegala Jun 20, 2019
b875c78
Merge pull request #169 from divyegala/rf-demo
Jun 20, 2019
2947f8b
Update coordinate_descent_demo.ipynb
Jun 21, 2019
f7eaf88
Merge pull request #10 from rapidsai/branch-0.8
Jun 21, 2019
2adf319
Update coordinate_descent_demo.ipynb
Jun 21, 2019
cc88236
Update coordinate_descent_demo.ipynb
Jun 21, 2019
85d3930
Update coordinate_descent_demo.ipynb
Jun 21, 2019
f711890
Update dbscan_demo.ipynb
Jun 21, 2019
df16228
Update dbscan_demo.ipynb
Jun 21, 2019
80603d5
Update coordinate_descent_demo.ipynb
Jun 21, 2019
a1e8574
Update knn_demo.ipynb
Jun 21, 2019
4d86ed8
Update pca_demo.ipynb
Jun 21, 2019
785a841
Update ridge_regression_demo.ipynb
Jun 21, 2019
e1638e0
Update ridge_regression_demo.ipynb
Jun 21, 2019
64b8985
Update sgd_demo.ipynb
Jun 21, 2019
8635a98
Update sgd_demo.ipynb
Jun 21, 2019
e22e8fd
Update tsvd_demo.ipynb
Jun 21, 2019
404f265
Update umap_demo.ipynb
Jun 21, 2019
f284293
Update README.md
Jun 21, 2019
d3bb439
Merge pull request #172 from Salonijain27/edit-docs
taureandyernv Jun 21, 2019
fb1a86c
Merge pull request #11 from rapidsai/branch-0.8
Jun 24, 2019
9568484
Update linear_regression_demo.ipynb
Jun 24, 2019
28e390b
Update linear_regression_demo.ipynb
Jun 24, 2019
a2585a8
Moving RF Demo to notebooks-extended
divyegala Jun 24, 2019
afcad30
Merge branch 'branch-0.8' of https://github.com/rapidsai/notebooks in…
divyegala Jun 24, 2019
194754e
Update ridge_regression_demo.ipynb
Jun 24, 2019
21defdb
Update sgd_demo.ipynb
Jun 24, 2019
0963cf3
Merge pull request #174 from divyegala/rf-demo
taureandyernv Jun 24, 2019
3ae4384
Merge pull request #175 from Salonijain27/hot-fix
taureandyernv Jun 24, 2019
16fc666
Merge remote-tracking branch 'upstream/branch-0.8' into feature-cudf-…
doyend Jun 24, 2019
9531f48
change the array size 1e8
doyend Jun 24, 2019
71ece47
Merge pull request #18 from yidong72/feature-cudf-notebook
taureandyernv Jun 24, 2019
1e1a0b6
changed the text
yidong72 Jun 24, 2019
e8a95f2
Merge pull request #176 from yidong72/feature-cudf-notebook
taureandyernv Jun 24, 2019
ea74a6b
Update README.md
taureandyernv Jun 24, 2019
a797ce8
Merge pull request #177 from rapidsai/taureandyernv-08-readme
taureandyernv Jun 24, 2019
3c9c31a
add kmeans_demo
Jun 25, 2019
c6a084a
Update CHANGELOG.md
raydouglass Jun 25, 2019
6301f66
Update CHANGELOG.md
raydouglass Jun 25, 2019
77bee7e
Update CHANGELOG.md
raydouglass Jun 25, 2019
d39395a
Updated to release 0.8 API
BradReesWork Jun 25, 2019
97e8ec2
pr changes to kmeans_demo
Jun 25, 2019
1833bf7
Merge branch 'branch-0.8' of https://github.com/rapidsai/notebooks in…
Jun 25, 2019
0375fba
changes to kmeans_demo
Jun 25, 2019
31ba16c
Update CHANGELOG.md
taureandyernv Jun 25, 2019
ae5ef4f
kmeans_demo changes
Jun 25, 2019
6955141
add kmeans_demo
Jun 25, 2019
cf379a8
kmeans_demo pr changes
Jun 25, 2019
3d67148
Merge branch 'branch-0.8' of https://github.com/jareducherek/notebook…
Jun 25, 2019
ba51e90
Update CHANGELOG.md
taureandyernv Jun 26, 2019
4779144
Merge pull request #178 from jareducherek/branch-0.8
taureandyernv Jun 26, 2019
9f531e3
Merge pull request #179 from BradReesWork/branch-0.8
taureandyernv Jun 26, 2019
52216d5
Merge pull request #180 from rapidsai/taureandyernv-changelog-08
taureandyernv Jun 26, 2019
6ee0ed1
Update CHANGELOG.md
raydouglass Jun 26, 2019
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
29 changes: 29 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,29 @@
# Notebooks 0.8.0 (27 June 2019)

## New Features
- Each notebook is linked to in README.md
- PR #178 Adding KMeans notebook
- PR #18 Added cuDF example notebooks

## Improvements
- Updated SSSP notebook
- Regression notebooks show of `train_test_split()` functon

## Bug Fixes


# Notebooks 0.7.0 (10 May 2019)

## New Features
- PR #133 Adding cuGraph notebooks

## Improvements
- PR #144 Added top level CHANGELOG. Added a README to cugraph

## Bug Fixes


# Notebooks 0.6.0 (22 Mar 2019)



69 changes: 67 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,70 @@
# RAPIDS Notebooks and Utilities

* `cuml`: contains four example notebooks showing the usage of different machine learning algorithms included in cuML: `knn`, `dbscan`, `pca` and `tsvd`. It also includes a small subset of the Mortgage Dataset used in the notebooks.
* `mortgage`: contains the notebook which runs ETL + ML on the Mortgage Dataset derived from [Fannie Mae’s Single-Family Loan Performance Data](http://www.fanniemae.com/portal/funding-the-market/data/loan-performance-data.html) ... download the mortgage dataset for use with the notebook [here](https://rapidsai.github.io/demos/datasets/mortgage-data)
## XGBoost Notebook
| Folder | Notebook Title | Description |
|-----------|------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| XGBoost | [XGBoost Demo](xgboost/XGBoost_Demo.ipynb) | This notebook shows the acceleration one can gain by using GPUs with XGBoost in RAPIDS. |
## CuML Notebooks
The cuML notebooks showcase how to use the machine learning algorithms implemented in cuML along with the advantages of using cuML over scikit-learn. These notebooks compare the time required and the performance of the algorithms. Below are a list of such algorithms:

| Folder | Notebook Title | Description |
|-----------|------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| cuML | [dbscan_demo](cuml/dbscan_demo.ipynb) | This notebook showcases density-based spatial clustering of applications with noise (dbscan) algorithm using the `fit` and `predict` functions |
| cuML | [knn_demo](cuml/knn_demo.ipynb) | This notebook showcases k-nearest neighbors (knn) algorithm using the `fit` and `kneighbors` functions |
| cuML | [Linear Regression Demo](cuml/linear_regression_demo.ipynb) | This notebook includes code example for linear regression algorithm and it showcases the `fit` and `predict` functions. |
| cuML | [Ridge Regression Demo](cuml/ridge_regression_demo.ipynb) | This notebook includes code examples of ridge regression and it showcases the `fit` and `predict` functions. |
| cuML | [Coordinate Descent](cuml/coordinate_descent_demo.ipynb) | This notebook includes code examples of lasso and elastic net models. These models are placed together so a comparison between the two can also be made in addition to their sklearn equivalent. |
| cuML | [pca_demo](cuml/pca_demo.ipynb) | This notebook showcases principal component analysis (PCA) algorithm where the model can be used for prediction (using `fit_transform`) as well as converting the transformed data into the original dataset (using `inverse_transform`). |
| cuML | [tsvd_demo](cuml/tsvd_demo.ipynb ) | This notebook showcases truncated singular value decomposition (tsvd) algorithm which like PCA performs both prediction and transformation of the converted dataset into the original data using `fit_transform` and `inverse_transform` functions respectively |
| cuML | [sgd_demo](cuml/sgd_demo.ipynb) | The stochastic gradient descent algorithm is demostrated in the notebook using `fit` and `predict` functions |
| cuML | [umap_demo](cuml/umap_demo.ipynb) | The uniform manifold approximation & projection algorithm is compared with the original author's equivalent non-GPU \Python implementation using `fit` and `transform` functions |
| cuML | [umap_demo_graphed](cuml/umap_demo_graphed.ipynb) | Demonstration of cuML uniform manifold approximation & projection algorithm's supervised approach against mortgage dataset and comparison of results against the original author's equivalent non-GPU \Python implementation. |
| cuML | [umap_demo_supervised](cuml/umap_supervised_demo.ipynb) | Demostration of UMAP supervised training. Uses a set of labels to perform supervised dimensionality reduction. UMAP can also be trained on datasets with incomplete labels, by using a label of "-1" for unlabeled samples. |
| cuML | [random forest](cuml/rf_demo.ipynb) | This notebook includes code examples of Random Forest and it showcases the `fit` and `predict` functions. |

## CuDF Notebooks
| Folder | Notebook Title | Description |
|-----------|------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| cuDF | [notebooks_Apply_Operations_in_cuDF](cudf/notebooks_Apply_Operations_in_cuDF.ipynb) | This notebook showcases two special methods where cuDF goes beyond the Pandas library: apply_rows and apply_chunk functions. They utilized the Numba library to accelerate the data transformation via GPU in parallel. |
| cuDF | [notebooks_numba_cuDF_integration](cudf/notebooks_numba_cuDF_integration.ipynb) | This notebook showcases how to use Numba CUDA to accelerate cuDF data transformation and how to step by step accelerate it using CUDA programming tricks |

## CuGraph Notebooks
| Folder | Notebook Title | Description |
|-----------|------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| cuGraph | [Louvain](cugraph/Louvain.ipynb) | Demonstration of using cuGraph to identify clusters in a test graph using the Louvain algorithm |
| cuGraph | [Vertex-Similarity](cugraph/Vertex-Similarity.ipynb) | Demonstration of using cuGraph to compute vertex similarity using both the Jaccard Similarity and the Overlap Coefficient. |
| cuGraph | [Weighted-Jaccard](cugraph/Weighted-Jaccard.ipynb) | Demonstration of using cuGraph to compute the Weighted Jaccard Similarity metric on our training dataset. |
| cuGraph | [Renumber](cugraph/Renumber.ipynb) | Demonstrate of using the renumbering features to assigned new vertex IDs to the test graph. This is useful for when the data sets is non-contiguous or not integer values |
| cuGraph | [BFS](cugraph/BFS.ipynb) | Demonstration of using cuGraph to computer the Bredth First Search space from a given vertex to all other in our training graph |
| cuGraph | [SSSP](cugraph/SSSP.ipynb) | Demonstration of using cuGraph to computer the The Shortest Path from a given vertex to all other in our training graph |
| cuGraph | [Spectral-Clustering](cugraph/Spectral-Clustering.ipynb) | Demonstration of using cuGraph to identify clusters in a test graph using Spectral Clustering using both the (A) Balance Cut and (B) the Modularity Maximization quality metrics |
| cuGraph | [Pagerank](cugraph/Pagerank.ipynb) | Demonstration of using both NetworkX and cuGraph to compute the PageRank of each vertex in our test dataset |
| cuGraph | [Triangle Counting](cugraph/Triangle-Counting.ipynb) | Demonstration of using both NetworkX and cuGraph to compute the the number of Triangles in our test dataset |

## Tutorial with an End to End workflow

| Folder | Notebook Title | Description |
|-----------|------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Tutorials | [DBSCAN_demo_full](tutorials/DBSCAN_Demo_Full.ipynb) | Demonstration of how to use DBSCAN - a popular clustering algorithm - and how to use the GPU accelerated implementation of this algorithm in RAPIDS. |

## Utils Scripts
| Folder | Script Title | Description |
|-----------|------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Utils | start-jupyter.sh | starts a JupyterLab environment for interacting with, and running, notebooks |
| Utils | stop-jupyter.sh | identifies all process IDs associated with Jupyter and kills them |
| Utils | dask-cluster.py | launches a configured Dask cluster (a set of nodes) for use within a notebook |
| Utils | dask-setup.sh | a low-level script for constructing a set of Dask workers on a single node |
| Utils | split-data-mortgage.sh | splits mortgage data files into smaller parts, and saves them for use with the mortgage notebook |

## Documentation (WIP)
| Folder | Document Title | Description |
|-----------|------------------------|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Docs | ngc-readme | |
| Docs | dockerhub-readme | |

## Additional Information
* The `cuml` folder also includes a small subset of the Mortgage Dataset used in the notebooks and the full image set from the Fashion MNIST dataset.

* `utils`: contains a set of useful scripts for interacting with RAPIDS

* For additional, community driven notebooks, which will include our blogs, tutorials, workflows, and more intricate examples, please see the [Notebooks Extended Repo](https://github.com/rapidsai/notebooks-extended)
Loading