Curated list of datasets..
- General
- Data Science
- Time Series
- Natural Language Processing
- Information Retrieval
- Speech Recognition
- Computer Vision
- Misc
-
UCI ML Repository: https://archive.ics.uci.edu/ml/index.html
-
Reddit Page for Datasets:
- https://www.reddit.com/r/datasets/
- https://twitter.com/reddit_datasets (Twitter Account)
-
R Datasets
-
University of Edinburugh - School of Informatics - Data Mining datsets
-
19 Free Public Data Sets For Your First Data Science Project - https://www.springboard.com/blog/free-public-data-sets-data-science-project/
- AirBnB: http://insideairbnb.com
- Time Series Data Library - Data Market https://datamarket.com/data/list/?q=provider:tsdl
-
The Stanford Question Answering Dataset: https://stanford-qa.com/
-
Trump Speeches: https://github.com/ryanmcdermott/trump-speeches
-
LAMBADA Dataset: http://clic.cimec.unitn.it/lambada/
-
Data for Everyone - Crowdflower https://www.crowdflower.com/data-for-everyone/
-
Datasets from MILA Lab - University of Montreal, Canada:
-
Open Data Stack Exchange
-
Academic Torrents - http://academictorrents.com/browse.php?cat=6
Fork and create a pull request