Supervised Learning: Donor Classification
Summary:
In this project, I help CharityML maximize the likelihood of receiving donations through constructing a model that predicts if a person receives income exceeding 50k/yr; a level known to indicate being a good candidate for donations.
Project Organization
- Exploratory Data Analysis
- Data Engineering
- Metrics
- Machine Learning Models
- Summary
Models
- Naive Bayes
- Logistic Regression
- Random Forest
- AdaBoost
- Gradient Boost
- Extreme Gradient Boost
- K-Nearest Neighbors
Performance
- Accuracy: 87.26%
- F-0.5 Score: 76.05% (high precision model)
Technologies
Python inside of an IPython Notebook and published with Reveal.js
Employed libraries:
- Pandas
==1.0.1 - Numpy
==1.18.1 - Scikit-learn
==0.22.1 - Scipy
==1.4.1 - Statsmodels
==0.11.0 - Plotly
==4.6.0 - Seaborn
==0.10.0 - Matplotlib
==3.1.3 - dython
==0.5.0.post2 - Custom Module: visualization.py
- Custom Module: modeling.py
- Custom Module: visuals.py