Meeting 3: November 13th 2018 - QMIND-Team/Sabermetrics GitHub Wiki

Agenda

  • Discuss Git Workflow

  • Updates on progress of task assigned last meeting

  • Define the overall architecture of the project

  • Discuss player evaluation and projection

    • How many years of data should we use
    • How does "Baseball era" affect our stats
    • What makes a good stat
    • How to define the best pitcher
      • Why wins is not an effective metric
      • Strikeout rate
      • Out Rate
      • Runs against
      • ERA
      • FIP and xFIP
      • SIERA
      • wOBA and xwOBA
    • Stats to compare pitchers
      • Physical characteristics
      • Handedness
      • Pitch type
      • Starter vs Reliever
      • Degradation over innings
      • Velocity
      • Pitch position data
      • Spin rate
      • Home park
      • League
    • Pitching data resources
    • Linear regression techniques
  • Tasks for this week

    • Data collection (query what data we have to work with)
      • team_pitching
      • b_war pitching
      • pitching_stats, b_ref, range
      • statcast_pitching
    • Data preparation
      • Find how many null values for each stat for the 2016 and 2017 seasons
        • Note any similarities or differences
      • Try running a linear regression on a 2016 stat vs a 2017 stat
      • Try eliminating null values and then running the same regression