ML, AI, and Data Science Free Datasets Download Links 2025

Datasets Download Links 2025

1035
Dataset download for research

This page provides thousands of free Data Science Datasets to download, discover and share cool data, connect with interesting people, and work together to solve problems faster. iLovePhd.com contains open metadata on 20 million texts, images, videos, and sounds gathered by the trusted and comprehensive resource for Datasets Download with Links in 2025.

Datasets Download Links 2025

Here is an updated and comprehensive list of free and reliable data science datasets available for download in 2025. These datasets span various domains such as machine learning, AI, economics, healthcare, and more. Each entry includes the dataset name, description, and a direct link for easy access.​

Dataset NameDescriptionDownload Link
Kaggle DatasetsA vast collection of datasets across various domains, including competitions and user-contributed data.kaggle.com/datasets
UCI Machine Learning RepositoryA classic repository offering datasets for machine learning research, including classification, regression, and clustering tasks.archive.ics.uci.edu/ml
Data.govThe U.S. government’s open data portal, providing datasets on agriculture, climate, education, energy, and more.data.gov
Data.gov.inIndia’s open government data platform offers datasets from various Indian government departments and ministries.data.gov.in
World Bank Open DataProvides global development data, including economic indicators, health statistics, and population metrics.data.worldbank.org
European Data PortalAggregates open data from European countries, covering various sectors like economy, health, and environment.data.europa.eu
Common CrawlA repository of web crawl data that can be used for research in natural language processing, machine learning, and data mining.commoncrawl.org
ZenodoAn open-access repository developed under the European OpenAIRE program and operated by CERN, allowing researchers to share datasets, research papers, and software.zenodo.org
Harvard Public Domain Books DatasetA collection of nearly one million public-domain books released by Harvard University, suitable for training AI models.Harvard Dataset
WorldMoveA synthetic human mobility dataset covering over 1,600 cities worldwide, useful for urban planning and transportation research.WorldMove Dataset
WeatherBenchA benchmark dataset for data-driven weather forecasting, providing processed data from the ERA5 archive.WeatherBench
PMLB (Penn Machine Learning Benchmarks)A collection of standardized datasets for evaluating machine learning algorithms, facilitating easy comparison of methods.PMLB GitHub
UK Data ArchiveHosts over 6,000 social science datasets, including large-scale surveys like the Labour Force Survey and Crime Survey for England and Wales.UK Data Archive
FiveThirtyEight DatasetsDatasets used in FiveThirtyEight articles, covering topics like politics, sports, science, and economics.FiveThirtyEight GitHub
AWS Public DatasetsA repository of large public datasets hosted on Amazon Web Services, including satellite imagery, genomic data, and web crawls.AWS Public Datasets
Google Dataset SearchA tool to find datasets stored across the web, facilitating access to datasets in various domains and formats.datasetsearch.research.google.com
YouTube Labeled Video DatasetA dataset containing labeled YouTube videos, useful for video classification and machine learning tasks.YouTube-8M Dataset
Analytics Vidhya DatasetsOffers datasets for practice and competitions in data science and machine learning.Analytics Vidhya
QuandlProvides financial, economic, and alternative datasets, suitable for investment and economic research.quandl.com
DrivenDataHosts data science competitions for social good, providing datasets on various humanitarian topics.DrivenData
MNIST DatabaseA classic dataset of handwritten digits, widely used for training image processing systems.MNIST Dataset
MovieLensProvides movie rating datasets, useful for building and evaluating recommendation systems.MovieLens
Jester DatasetA dataset of joke ratings, used for research in collaborative filtering and recommendation systems.Jester Dataset
Awesome Public DatasetsA curated list of high-quality public datasets categorized by topic and domain.Awesome Public Datasets
Big Data Analytics News – 200+ Free DatasetsA comprehensive guide listing over 200 free datasets across various domains, including AI, NLP, and machine learning.Big Data Analytics News

These datasets are valuable resources for data scientists, researchers, and enthusiasts looking to explore and analyze data across different fields. They are freely available and can be used for various projects, including machine learning model training, data analysis, and academic research.