Open data resources for Capstone Projects and your Data Science Journey.
Open Data Sets & Portals:
- Papers w/ Code ML Datasets
- Google Datasets
- Data is Plural
- India Open Data Gov
- Canada Open Data Gov
- US Open Data Gov
- Quandl Financial Data
- UCI Machine Learning Datasets
- Gapminder
- FiveThirtyEight
- DataPortals
- world
- Wikipedia ML Listings
- Cool Datasets on Twitter
- Public Data Science Datasets
- OpenML
- Github: Awesome Public Datasets
- Kaggle Datasets
- https://data.ny.gov/
- Open Data Monitor
- AWS Datasets
- Common Crawl
- Socrata
- S. Census
- European Union Open Data Portal
- UN Data
- CIA Data
- HealthData
- California Data
- Google Public Data
- Flowing Data
- More Data Sets
- NLP Data sets
- Microsoft Research Open Data
- Github Code Search
- Public Data
- Open Source Sports
- Machine Learning Datasets
- CERN Open Data Portal
- Mexico Open Government Data
- NLP Datasets
APIs:
- Public APIs
- Programmable Web
- Zillow
- Wikipedia
- Google Scholar
- AlphaVantage
- More APIs
- Python APIs
- Python Wrapper APIs
- Twitter Scraper
License
To the extent possible under law, David Yakobovitch has licensed this work under Creative Commons, 4.0-NC-ND. This license is the most restrictive of Creative Commons six main licenses, only allowing others to download your works and share them with others as long as they credit the author, but they can’t change them in any way or use them commercially.