Meeting 3: November 13th 2018 - QMIND-Team/Sabermetrics GitHub Wiki
Agenda
-
Discuss Git Workflow
-
Updates on progress of task assigned last meeting
-
Define the overall architecture of the project
-
Discuss player evaluation and projection
- How many years of data should we use
- How does "Baseball era" affect our stats
- What makes a good stat
- How to define the best pitcher
- Why wins is not an effective metric
- Strikeout rate
- Out Rate
- Runs against
- ERA
- FIP and xFIP
- SIERA
- wOBA and xwOBA
- Stats to compare pitchers
- Physical characteristics
- Handedness
- Pitch type
- Starter vs Reliever
- Degradation over innings
- Velocity
- Pitch position data
- Spin rate
- Home park
- League
- Pitching data resources
- Linear regression techniques
-
Tasks for this week
- Data collection (query what data we have to work with)
- team_pitching
- b_war pitching
- pitching_stats, b_ref, range
- statcast_pitching
- Data preparation
- Find how many null values for each stat for the 2016 and 2017 seasons
- Note any similarities or differences
- Try running a linear regression on a 2016 stat vs a 2017 stat
- Try eliminating null values and then running the same regression
- Find how many null values for each stat for the 2016 and 2017 seasons
- Data collection (query what data we have to work with)