Datasets

A curated list of useful datasets for research and analysis. Use the filters and search to explore, or click column headers to sort.

Name Category Type Source Format
Yahoo Finance
Historical and real-time stock prices, financial statements, and market data.
FinanceNon-AcademicYahooAPI, CSV
FRED Economic Data
Over 800,000 economic time series from various sources including GDP, employment, inflation.
EconomicsAcademicFederal Reserve Bank of St. LouisAPI, CSV, Excel
Quandl
Financial, economic, and alternative data covering stocks, futures, options, and more.
FinanceNon-AcademicNasdaqAPI, CSV
Kenneth French Data Library
Fama-French factors, portfolios, and returns data for asset pricing research.
FinanceAcademicDartmouth CollegeCSV, TXT
CRSP
Comprehensive historical stock data for US equities.
FinanceAcademicUniversity of ChicagoVarious
Compustat
Fundamental financial data for public companies worldwide.
FinanceAcademicS&P GlobalVarious
UCI Machine Learning Repository
Classic machine learning datasets for classification, regression, and clustering.
Machine LearningAcademicUC IrvineVarious
Kaggle Datasets
Community-contributed datasets spanning various domains and competitions.
Machine LearningNon-AcademicKaggleVarious
Hugging Face Datasets
Large collection of NLP and ML datasets with easy-to-use Python API.
NLPNon-AcademicHugging FaceVarious
ImageNet
Large-scale image database for visual object recognition research.
Computer VisionAcademicStanford/PrincetonImages
EIA Open Data
US energy production, consumption, prices, and forecasts.
EnergyNon-AcademicUS Energy Information AdministrationAPI, CSV
ENTSO-E Transparency Platform
European electricity market data including generation, load, and prices.
EnergyNon-AcademicENTSO-EAPI, CSV
World Bank Open Data
Global development indicators covering demographics, economics, health, and education.
EconomicsAcademicWorld BankAPI, CSV, Excel
Our World in Data
Research and data on global problems including health, poverty, and environment.
Social SciencesAcademicUniversity of OxfordCSV, GitHub
IPUMS
Harmonized census and survey data from around the world.
DemographicsAcademicUniversity of MinnesotaVarious
Common Crawl
Petabytes of web crawl data collected over years.
NLPNon-AcademicCommon Crawl FoundationWARC, WET
arXiv Dataset
Metadata and full text of scientific papers from arXiv.
NLPAcademicCornell UniversityJSON
Showing 17 of 17 datasets