ML: Project Template - dudycooly/1235 GitHub Wiki

Python Project Template

1. Prepare Problem

a) Load libraries

b) Load dataset

2. Summarize Data

a) Descriptive statistics

b) Data visualizations

3. Prepare Data

a) Data Cleaning

b) Feature Selection

c) Data Transforms

4. Evaluate Algorithms

a) Split-out validation dataset

b) Test options and evaluation metric

c) Spot Check Algorithms

d) Compare Algorithms

5. Improve Accuracy

a) Algorithm Tuning

b) Ensembles

6. Finalize Model

a) Predictions on validation dataset

b) Create standalone model on entire training dataset

c) Save model for later use