Machine Learning From Scratch

The most common machine learning models and algorithms programmed from first principles. The aim is to implement these models and algorithms to the same standard as widely used libraries such as scikit-learn with clean, efficient code, clear modular design, and rigorous adherence to the underlying mathematics.

Model / Algorithm	Type	Completed?	Notes
Linear Regression	Regression	✅	TBR
Logistic Regression	Classification	✅	TBR
K-Nearest Neighbours	Classification	✅	TBR
K-Means	Clustering	✅	TBR
Stochastic Gradient Descent	Optimisation	✅	TBR
Naive Bayes Classifiers	Classification	Categorical ✅, Multinomial ❌, Gaussian ❌	TBR
Decision Trees	Classification/Regression	Classifier ✅, Regressor ✅, CCP Pruning ✅, Feature Importance ✅	Speed and efficiently need to be further optimised for production-level.
Random Forests	Classification/Regression	Bootstrap aggregated ✅, Rotation forest ❌, Extremely Randomised Trees (ERT) ❌	TBR
Support Vector Machine	Classification/Regression	Hard-margin SVC ✅, Soft-margin SVC ✅ Kernel Trick OvO ❌ OvR ✅	CURRENTLY WORKING ON
Principal Component Analysis	Dimensionality Reduction	✅	First principle derivation of the eigenvalue equation need to be added.
DBSCAN	Clustering	✅	Pseudo-code needs to be added for queuing algorithm used for cluster growth.
Gaussian Mixture Models	Clustering (Probabilistic)	⏳🚧	In progress (put on hold). Need to formally explore Maximum Likelihood Estimation (MLE) theory.
Linear Discriminant Analysis	Dimensionality Reduction	❌	Not started.
Gradient Boosting	Classification/Regression	⏳🚧	Theory section written, but the model has not been implemented.

Current working on: Support Vector Machines

Implement the kernel trick for SVCs.
Implement a soft-margin SVC using the dual formulation.
Implement a soft-margin SVC using SGD and Hinge loss.
Combine multiple SVCs (OvO and OvR) to handle classification with C>2 number of classes.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
models		models
utils		utils
ConfusionMatrix.ipynb		ConfusionMatrix.ipynb
Cross-Validation.ipynb		Cross-Validation.ipynb
DBSCAN.ipynb		DBSCAN.ipynb
DecisionTreeClassifier - CCP.ipynb		DecisionTreeClassifier - CCP.ipynb
DecisionTreeClassifier - Feature Importance.ipynb		DecisionTreeClassifier - Feature Importance.ipynb
DecisionTreeClassifier.ipynb		DecisionTreeClassifier.ipynb
DecisionTreeRegressor.ipynb		DecisionTreeRegressor.ipynb
Gaussian Mixture Models.ipynb		Gaussian Mixture Models.ipynb
Gradient Boosting.ipynb		Gradient Boosting.ipynb
Gradient Descent II.ipynb		Gradient Descent II.ipynb
Gradient Descent.ipynb		Gradient Descent.ipynb
K-Nearest Neighbours.ipynb		K-Nearest Neighbours.ipynb
K-means.ipynb		K-means.ipynb
LICENSE.txt		LICENSE.txt
Linear Regression..ipynb		Linear Regression..ipynb
Logistic Regression.ipynb		Logistic Regression.ipynb
Maximum Likelihood Estimation.ipynb		Maximum Likelihood Estimation.ipynb
Multiclass SVC.ipynb		Multiclass SVC.ipynb
Naive Bayes Classifier.ipynb		Naive Bayes Classifier.ipynb
PCA.ipynb		PCA.ipynb
README.md		README.md
Random Forests.ipynb		Random Forests.ipynb
RandomForestClassifier.ipynb		RandomForestClassifier.ipynb
RandomForestRegressor.ipynb		RandomForestRegressor.ipynb
SVC (Hard-margin).ipynb		SVC (Hard-margin).ipynb
SVC (Soft-margin).ipynb		SVC (Soft-margin).ipynb
SVC - More Kernels.ipynb		SVC - More Kernels.ipynb
SVC - The Kernel Trick.ipynb		SVC - The Kernel Trick.ipynb
decisiontree_icon.png		decisiontree_icon.png
randomforest_icon.png		randomforest_icon.png
svc_icon.png		svc_icon.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Machine Learning From Scratch

About

Uh oh!

Releases

Packages

Languages

License

prithvi-ramrucha/Machine-Learning-From-Scratch

Folders and files

Latest commit

History

Repository files navigation

Machine Learning From Scratch

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages