ML Project Checklist - liniribeiro/machine_learning GitHub Wiki

  • Analize the problem;
  • Get data;
  • Identify and visualize the data to get usefull infos;
  • Prepare the data to be used by the ML algorithm;
  • Select and train a model;
  • Improve model;
  • Present the solution;
  • Make available in production, monitoring and maintain.

Analyzing the problem

  • Whats the business objective?
  • Whats the actual solution?
  • We are going to use supervised, not supervised or reinforcement learning?
    • Every time that we have labeled data, we can use the supervised learning
  • its going to be a task of regression or classification? or another?
    • Every time that you are asked for predict a value, its a regression task.
  • We are going to use training in batches or online?
    • If we don't have a large amount of data and we don't have the necessity of using newer data, we use the training in batches.
    • If the data are very bug, we can use the MapReduce technique or use online training.