Finding datasets - lmmx/devnotes GitHub Wiki
Pete Skomoroch used to have a list, which was apparently scraped by InfoChimps/CKAN (but when I try to use CKAN I just get ads for diet pills and seems to be other spam?)
Lists of dataset repos
Searching for PS's old site can help surface other dataset repos, e.g.:
- Websites for Datasets, a page on the course Introduction To Big Data Analysis (STAT 29000, Fall 2018) at Purdue
Dataset repos
- Google Datasets
- e.g. "alpha channel images" was how I found SUN2012 via
- Some of these will link to Kaggle