SISS Slides, code and other material - EMbeDS-education/ComputingDataAnalysisModeling20242025 GitHub Wiki
Class 1: Introduction
- Introduction to Applied Statistical Modelling: General information about the course, final exam, and introduction on the theoretical statistical background
Class 1-2:
-
Exercise on R:
- Linear Regression Toolkit
- Walmart Database for assignment - Walmart columns description
- Assignment : Commented R file to perform complete Linear Regression Analysis on the Walmart database. | Final Script
Class 3-4:
-
Exercise on R:
- GLM Toolkit - 1 | R-Studio file
- Spotify Database for assignment - Spotify Database description: Data were scaled and balanced to provide better performance in the regression model. | Original Spotify db
- Assignment : Commented R file to perform complete Logit, Probit, Ordinal Logit, and Multinomial Logit Analysis on different databases.
Class 5:
-
Exercise on R:
- GLM Toolkit - 1: Reference and assignment on Ordinal Logit are the same of GLM Toolkit 1 (Class 3-4)
Class 6:
-
Exercise on R:
- GLM Toolkit - 2 | Script: GLM Toolkit - 2
- NYC Bicycle Database for assignment | Documentation
- Assignment : Commented R file to perform complete GLM Analysis (Poisson and overdispersed Poisson) on different databases.
Class 7:
-
Exercise on R:
Final Exam
We suggest using the database at the link below for your final project. It is an ISTAT database concerning the lifestyle and health habits of a sample of Italian households. The database covers the period from 2015 to 2021.
Toolkit Database
- At this link, you will find all the databases covered in the different toolkits, along with the corresponding documentation. The directory also contains other databases that you can use to practice your skills or prepare your project for the final exam.