GitHub

Home Credit Default Risk Classification

The goal of this project is to predict whether an applicant will be able to pay back their loan. The data is provided by this Kaggle competition.

We use the following types of classification: Logistic, LDA, and SVC.

Our process is as follows:

Split the data into training, validation, and testing sets.
Fit and evaluate a bunch of models on training data.
Take the model of each type (Logistic, LDA, and SVC) that performed best on training and evaluate on validation data.
Take the model that performed best on validation data and report metrics as it performs on test set.

The code organization is as follows:

Name		Name	Last commit message	Last commit date
Latest commit History 94 Commits
images		images
notebooks		notebooks
proposal		proposal
report		report
scripts		scripts
.gitignore		.gitignore
DATA 403 Project 2.Rproj		DATA 403 Project 2.Rproj
README.md		README.md