6.419x - roadfoodr/mitx-sds-resources GitHub Wiki

6.419x Data Analysis: Statistical Modeling and Computation in Applications

General:

Module 1. Review: Statistics, Correlation, Regression, Gradient Descent

Observational Studies and Experiments

Hypothesis Testing

Likelihood Ratio Test and Multiple Hypothesis Testing

Correlation and Least Squares Regression

Gradient Descent

Recitation 1: Average Treatment Effect versus Average Treatment Effect for the Treated

Module 2: Genomics and High-Dimensional Data

Visualization of High-Dimensional Data

Methods of Classification on High-Dimensional Data

Clustering with High-Dimensional Data

Recitation: Demonstration of Data Visualization, Clustering, and Classification

Module 3: Network Analysis

Graph Basics

Graph Centrality Measures

Spectral Clustering

Graphical models

Jupyter Notebook: From Data to Networks, Using Python Networkx

Module 4: Time Series

Introduction to Time Series: Trend, Seasonality, Stationarity, Autocovariance

Time Series: Statistical Models

Introduction to Time Series Analysis 3

Module 5: Environmental Data and Gaussian Processes

Environmental Data and Gaussian Processes

Spatial Prediction

Sensing and Analyzing global patterns of dependence

Recitation: Working with Gaussians, and introduction to Gaussian Process