diff --git a/README.rst b/README.rst index b3f5bc0..67c4bda 100644 --- a/README.rst +++ b/README.rst @@ -5,6 +5,8 @@ Awesome Public Datasets :alt: Awesome :target: https://github.com/sindresorhus/awesome +.. |OK_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/ok-24.png +.. |FIXME_ICON| image:: https://raw.githubusercontent.com/awesomedata/apd2/master/deploy/fixme-24.png **NOTICE**: This repo is automatically generated by `APD2 `_. Please **DO NOT** modify this file directly. We have provided @@ -18,1188 +20,1188 @@ Most of the data sets listed below are free, however, some are not. Other amazingly awesome lists can be found in `sindresorhus's awesome `_ list. -.. contents:: Table of Contents +.. contents:: **Table of Contents** Agriculture ----------- -* `U.S. Department of Agriculture's Nutrient Database `_ +* `U.S. Department of Agriculture's Nutrient Database `_ |OK_ICON| -* `U.S. Department of Agriculture's PLANTS Database `_ +* `U.S. Department of Agriculture's PLANTS Database `_ |OK_ICON| Biology ------- -* `1000 Genomes `_ +* `1000 Genomes `_ |OK_ICON| -* `American Gut (Microbiome Project) `_ +* `American Gut (Microbiome Project) `_ |OK_ICON| -* `Broad Bioimage Benchmark Collection (BBBC) `_ +* `Broad Bioimage Benchmark Collection (BBBC) `_ |OK_ICON| -* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ +* `Broad Cancer Cell Line Encyclopedia (CCLE) `_ |OK_ICON| -* `Cell Image Library `_ +* `Cell Image Library `_ |OK_ICON| -* `Complete Genomics Public Data `_ +* `Complete Genomics Public Data `_ |OK_ICON| -* `EBI ArrayExpress `_ +* `EBI ArrayExpress `_ |OK_ICON| -* `EBI Protein Data Bank in Europe `_ +* `EBI Protein Data Bank in Europe `_ |OK_ICON| -* `ENCODE project `_ +* `ENCODE project `_ |OK_ICON| -* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ +* `Electron Microscopy Pilot Image Archive (EMPIAR) `_ |OK_ICON| -* `Ensembl Genomes `_ +* `Ensembl Genomes `_ |OK_ICON| -* `Gene Expression Omnibus (GEO) `_ +* `Gene Expression Omnibus (GEO) `_ |OK_ICON| -* `Gene Ontology (GO) `_ +* `Gene Ontology (GO) `_ |OK_ICON| -* `Global Biotic Interactions (GloBI) `_ +* `Global Biotic Interactions (GloBI) `_ |OK_ICON| -* `Harvard Medical School (HMS) LINCS Project `_ +* `Harvard Medical School (HMS) LINCS Project `_ |OK_ICON| -* `Human Genome Diversity Project `_ +* `Human Genome Diversity Project `_ |OK_ICON| -* `Human Microbiome Project (HMP) `_ +* `Human Microbiome Project (HMP) `_ |OK_ICON| -* `ICOS PSP Benchmark `_ +* `ICOS PSP Benchmark `_ |OK_ICON| -* `International HapMap Project `_ +* `International HapMap Project `_ |OK_ICON| -* `Journal of Cell Biology DataViewer `_ +* `Journal of Cell Biology DataViewer `_ |OK_ICON| -* `MIT Cancer Genomics Data `_ +* `MIT Cancer Genomics Data `_ |OK_ICON| -* `NCBI Proteins `_ +* `NCBI Proteins `_ |OK_ICON| -* `NCBI Taxonomy `_ +* `NCBI Taxonomy `_ |OK_ICON| -* `NCI Genomic Data Commons `_ +* `NCI Genomic Data Commons `_ |OK_ICON| -* `NIH Microarray data `_ +* `NIH Microarray data `_ |FIXME_ICON| -* `OpenSNP genotypes data `_ +* `OpenSNP genotypes data `_ |OK_ICON| -* `Pathguid - Protein-Protein Interactions Catalog `_ +* `Pathguid - Protein-Protein Interactions Catalog `_ |OK_ICON| -* `Protein Data Bank `_ +* `Protein Data Bank `_ |OK_ICON| -* `Psychiatric Genomics Consortium `_ +* `Psychiatric Genomics Consortium `_ |OK_ICON| -* `PubChem Project `_ +* `PubChem Project `_ |OK_ICON| -* `PubGene (now Coremine Medical) `_ +* `PubGene (now Coremine Medical) `_ |OK_ICON| -* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ +* `Sanger Catalogue of Somatic Mutations in Cancer (COSMIC) `_ |OK_ICON| -* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ +* `Sanger Genomics of Drug Sensitivity in Cancer Project (GDSC) `_ |OK_ICON| -* `Sequence Read Archive(SRA) `_ +* `Sequence Read Archive(SRA) `_ |OK_ICON| -* `Stanford Microarray Data `_ +* `Stanford Microarray Data `_ |FIXME_ICON| -* `Stowers Institute Original Data Repository `_ +* `Stowers Institute Original Data Repository `_ |OK_ICON| -* `Systems Science of Biological Dynamics (SSBD) Database `_ +* `Systems Science of Biological Dynamics (SSBD) Database `_ |OK_ICON| -* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ +* `The Cancer Genome Atlas (TCGA), available via Broad GDAC `_ |OK_ICON| -* `The Catalogue of Life `_ +* `The Catalogue of Life `_ |OK_ICON| -* `The Personal Genome Project `_ +* `The Personal Genome Project `_ |OK_ICON| -* `UCSC Public Data `_ +* `UCSC Public Data `_ |OK_ICON| -* `UniGene `_ +* `UniGene `_ |OK_ICON| -* `Universal Protein Resource (UnitProt) `_ +* `Universal Protein Resource (UnitProt) `_ |OK_ICON| Climate+Weather --------------- -* `Actuaries Climate Index `_ +* `Actuaries Climate Index `_ |OK_ICON| -* `Australian Weather `_ +* `Australian Weather `_ |OK_ICON| -* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ +* `Aviation Weather Center - Consistent, timely and accurate weather information for the world airspace system `_ |OK_ICON| -* `Brazilian Weather - Historical data (In Portuguese) `_ +* `Brazilian Weather - Historical data (In Portuguese) `_ |OK_ICON| -* `Canadian Meteorological Centre `_ +* `Canadian Meteorological Centre `_ |OK_ICON| -* `Climate Data from UEA (updated monthly) `_ +* `Climate Data from UEA (updated monthly) `_ |OK_ICON| -* `European Climate Assessment & Dataset `_ +* `European Climate Assessment & Dataset `_ |OK_ICON| -* `Global Climate Data Since 1929 `_ +* `Global Climate Data Since 1929 `_ |OK_ICON| -* `NASA Global Imagery Browse Services `_ +* `NASA Global Imagery Browse Services `_ |OK_ICON| -* `NOAA Bering Sea Climate `_ +* `NOAA Bering Sea Climate `_ |FIXME_ICON| -* `NOAA Climate Datasets `_ +* `NOAA Climate Datasets `_ |OK_ICON| -* `NOAA Realtime Weather Models `_ +* `NOAA Realtime Weather Models `_ |OK_ICON| -* `NOAA SURFRAD Meteorology and Radiation Datasets `_ +* `NOAA SURFRAD Meteorology and Radiation Datasets `_ |OK_ICON| -* `The World Bank Open Data Resources for Climate Change `_ +* `The World Bank Open Data Resources for Climate Change `_ |OK_ICON| -* `UEA Climatic Research Unit `_ +* `UEA Climatic Research Unit `_ |OK_ICON| -* `WU Historical Weather Worldwide `_ +* `WU Historical Weather Worldwide `_ |OK_ICON| -* `WorldClim - Global Climate Data `_ +* `WorldClim - Global Climate Data `_ |OK_ICON| ComplexNetworks --------------- -* `AMiner Citation Network Dataset `_ +* `AMiner Citation Network Dataset `_ |OK_ICON| -* `CrossRef DOI URLs `_ +* `CrossRef DOI URLs `_ |OK_ICON| -* `DBLP Citation dataset `_ +* `DBLP Citation dataset `_ |OK_ICON| -* `DIMACS Road Networks Collection `_ +* `DIMACS Road Networks Collection `_ |OK_ICON| -* `NBER Patent Citations `_ +* `NBER Patent Citations `_ |OK_ICON| -* `NIST complex networks data collection `_ +* `NIST complex networks data collection `_ |OK_ICON| -* `Network Repository with Interactive Exploratory Analysis Tools `_ +* `Network Repository with Interactive Exploratory Analysis Tools `_ |OK_ICON| -* `Protein-protein interaction network `_ +* `Protein-protein interaction network `_ |OK_ICON| -* `PyPI and Maven Dependency Network `_ +* `PyPI and Maven Dependency Network `_ |OK_ICON| -* `Scopus Citation Database `_ +* `Scopus Citation Database `_ |OK_ICON| -* `Small Network Data `_ +* `Small Network Data `_ |OK_ICON| -* `Stanford GraphBase `_ +* `Stanford GraphBase `_ |OK_ICON| -* `Stanford Large Network Dataset Collection `_ +* `Stanford Large Network Dataset Collection `_ |OK_ICON| -* `Stanford Longitudinal Network Data Sources `_ +* `Stanford Longitudinal Network Data Sources `_ |OK_ICON| -* `The Koblenz Network Collection `_ +* `The Koblenz Network Collection `_ |OK_ICON| -* `The Laboratory for Web Algorithmics (UNIMI) `_ +* `The Laboratory for Web Algorithmics (UNIMI) `_ |OK_ICON| -* `The Nexus Network Repository `_ +* `The Nexus Network Repository `_ |FIXME_ICON| -* `UCI Network Data Repository `_ +* `UCI Network Data Repository `_ |OK_ICON| -* `UFL sparse matrix collection `_ +* `UFL sparse matrix collection `_ |OK_ICON| -* `WSU Graph Database `_ +* `WSU Graph Database `_ |OK_ICON| ComputerNetworks ---------------- -* `3.5B Web Pages from CommonCrawl 2012 `_ +* `3.5B Web Pages from CommonCrawl 2012 `_ |OK_ICON| -* `53.5B Web clicks of 100K users in Indiana Univ. `_ +* `53.5B Web clicks of 100K users in Indiana Univ. `_ |OK_ICON| -* `CAIDA Internet Datasets `_ +* `CAIDA Internet Datasets `_ |OK_ICON| -* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ +* `CRAWDAD Wireless datasets from Dartmouth Univ. `_ |FIXME_ICON| -* `ClueWeb09 - 1B web pages `_ +* `ClueWeb09 - 1B web pages `_ |OK_ICON| -* `ClueWeb12 - 733M web pages `_ +* `ClueWeb12 - 733M web pages `_ |OK_ICON| -* `CommonCrawl Web Data over 7 years `_ +* `CommonCrawl Web Data over 7 years `_ |OK_ICON| -* `Criteo click-through data `_ +* `Criteo click-through data `_ |OK_ICON| -* `OONI: Open Observatory of Network Interference - Internet censorship data `_ +* `OONI: Open Observatory of Network Interference - Internet censorship data `_ |OK_ICON| -* `Open Mobile Data by MobiPerf `_ +* `Open Mobile Data by MobiPerf `_ |OK_ICON| -* `Rapid7 Sonar Internet Scans `_ +* `Rapid7 Sonar Internet Scans `_ |OK_ICON| -* `UCSD Network Telescope, IPv4 /8 net `_ +* `UCSD Network Telescope, IPv4 /8 net `_ |OK_ICON| DataChallenges -------------- -* `Bruteforce Database `_ +* `Bruteforce Database `_ |OK_ICON| -* `Challenges in Machine Learning `_ +* `Challenges in Machine Learning `_ |OK_ICON| -* `CrowdANALYTIX dataX `_ +* `CrowdANALYTIX dataX `_ |OK_ICON| -* `D4D Challenge of Orange `_ +* `D4D Challenge of Orange `_ |FIXME_ICON| -* `DrivenData Competitions for Social Good `_ +* `DrivenData Competitions for Social Good `_ |OK_ICON| -* `ICWSM Data Challenge (since 2009) `_ +* `ICWSM Data Challenge (since 2009) `_ |FIXME_ICON| -* `KDD Cup by Tencent 2012 `_ +* `KDD Cup by Tencent 2012 `_ |OK_ICON| -* `Kaggle Competition Data `_ +* `Kaggle Competition Data `_ |OK_ICON| -* `Localytics Data Visualization Challenge `_ +* `Localytics Data Visualization Challenge `_ |OK_ICON| -* `Netflix Prize `_ +* `Netflix Prize `_ |OK_ICON| -* `Space Apps Challenge `_ +* `Space Apps Challenge `_ |OK_ICON| -* `Telecom Italia Big Data Challenge `_ +* `Telecom Italia Big Data Challenge `_ |OK_ICON| -* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ +* `TravisTorrent Dataset - MSR'2017 Mining Challenge `_ |OK_ICON| -* `Yelp Dataset Challenge `_ +* `Yelp Dataset Challenge `_ |OK_ICON| EarthScience ------------ -* `AQUASTAT - Global water resources and uses `_ +* `AQUASTAT - Global water resources and uses `_ |OK_ICON| -* `BODC - marine data of ~22K vars `_ +* `BODC - marine data of ~22K vars `_ |OK_ICON| -* `EOSDIS - NASA's earth observing system data `_ +* `EOSDIS - NASA's earth observing system data `_ |OK_ICON| -* `Earth Models `_ +* `Earth Models `_ |OK_ICON| -* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ +* `Integrated Marine Observing System (IMOS) - roughly 30TB of ocean measurements `_ |OK_ICON| -* `Marinexplore - Open Oceanographic Data `_ +* `Marinexplore - Open Oceanographic Data `_ |OK_ICON| -* `Smithsonian Institution Global Volcano and Eruption Database `_ +* `Smithsonian Institution Global Volcano and Eruption Database `_ |OK_ICON| -* `USGS Earthquake Archives `_ +* `USGS Earthquake Archives `_ |OK_ICON| Economics --------- -* `American Economic Association (AEA) `_ +* `American Economic Association (AEA) `_ |OK_ICON| -* `EconData from UMD `_ +* `EconData from UMD `_ |OK_ICON| -* `Economic Freedom of the World Data `_ +* `Economic Freedom of the World Data `_ |FIXME_ICON| -* `Historical MacroEconomc Statistics `_ +* `Historical MacroEconomc Statistics `_ |OK_ICON| -* `International Economics Database `_ +* `International Economics Database `_ |OK_ICON| -* `International Trade Statistics `_ +* `International Trade Statistics `_ |OK_ICON| -* `Internet Product Code Database `_ +* `Internet Product Code Database `_ |OK_ICON| -* `Joint External Debt Data Hub `_ +* `Joint External Debt Data Hub `_ |OK_ICON| -* `Jon Haveman International Trade Data Links `_ +* `Jon Haveman International Trade Data Links `_ |OK_ICON| -* `OpenCorporates Database of Companies in the World `_ +* `OpenCorporates Database of Companies in the World `_ |OK_ICON| -* `Our World in Data `_ +* `Our World in Data `_ |OK_ICON| -* `SciencesPo World Trade Gravity Datasets `_ +* `SciencesPo World Trade Gravity Datasets `_ |OK_ICON| -* `The Atlas of Economic Complexity `_ +* `The Atlas of Economic Complexity `_ |OK_ICON| -* `The Center for International Data `_ +* `The Center for International Data `_ |OK_ICON| -* `The Observatory of Economic Complexity `_ +* `The Observatory of Economic Complexity `_ |OK_ICON| -* `UN Commodity Trade Statistics `_ +* `UN Commodity Trade Statistics `_ |OK_ICON| -* `UN Human Development Reports `_ +* `UN Human Development Reports `_ |OK_ICON| Education --------- -* `College Scorecard Data `_ +* `College Scorecard Data `_ |OK_ICON| -* `Student Data from Free Code Camp `_ +* `Student Data from Free Code Camp `_ |OK_ICON| Energy ------ -* `AMPds `_ +* `AMPds `_ |OK_ICON| -* `BLUEd `_ +* `BLUEd `_ |OK_ICON| -* `COMBED `_ +* `COMBED `_ |OK_ICON| -* `DRED `_ +* `DRED `_ |OK_ICON| -* `ECO `_ +* `ECO `_ |OK_ICON| -* `EIA `_ +* `EIA `_ |OK_ICON| -* `HES - Household Electricity Study, UK `_ +* `HES - Household Electricity Study, UK `_ |OK_ICON| -* `HFED `_ +* `HFED `_ |OK_ICON| -* `PLAID - The Plug Load Appliance Identification Dataset `_ +* `PLAID - The Plug Load Appliance Identification Dataset `_ |FIXME_ICON| -* `REDD `_ +* `REDD `_ |OK_ICON| -* `Tracebase `_ +* `Tracebase `_ |OK_ICON| -* `UK-DALE - UK Domestic Appliance-Level Electricity `_ +* `UK-DALE - UK Domestic Appliance-Level Electricity `_ |OK_ICON| -* `WHITED `_ +* `WHITED `_ |OK_ICON| -* `iAWE `_ +* `iAWE `_ |OK_ICON| Finance ------- -* `CBOE Futures Exchange `_ +* `CBOE Futures Exchange `_ |FIXME_ICON| -* `Google Finance `_ +* `Google Finance `_ |OK_ICON| -* `Google Trends `_ +* `Google Trends `_ |OK_ICON| -* `NASDAQ `_ +* `NASDAQ `_ |OK_ICON| -* `NYSE Market Data `_ +* `NYSE Market Data `_ |OK_ICON| -* `OANDA `_ +* `OANDA `_ |OK_ICON| -* `OSU Financial data `_ +* `OSU Financial data `_ |OK_ICON| -* `Quandl `_ +* `Quandl `_ |OK_ICON| -* `St Louis Federal `_ +* `St Louis Federal `_ |OK_ICON| -* `Yahoo Finance `_ +* `Yahoo Finance `_ |OK_ICON| GIS --- -* `ArcGIS Open Data portal `_ +* `ArcGIS Open Data portal `_ |OK_ICON| -* `Cambridge, MA, US, GIS data on GitHub `_ +* `Cambridge, MA, US, GIS data on GitHub `_ |OK_ICON| -* `Factual Global Location Data `_ +* `Factual Global Location Data `_ |OK_ICON| -* `Geo Spatial Data from ASU `_ +* `Geo Spatial Data from ASU `_ |OK_ICON| -* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ +* `Geo Wiki Project - Citizen-driven Environmental Monitoring `_ |OK_ICON| -* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ +* `GeoFabrik - OSM data extracted to a variety of formats and areas `_ |OK_ICON| -* `GeoNames Worldwide `_ +* `GeoNames Worldwide `_ |OK_ICON| -* `Global Administrative Areas Database (GADM) `_ +* `Global Administrative Areas Database (GADM) `_ |OK_ICON| -* `Homeland Infrastructure Foundation-Level Data `_ +* `Homeland Infrastructure Foundation-Level Data `_ |OK_ICON| -* `Landsat 8 on AWS `_ +* `Landsat 8 on AWS `_ |OK_ICON| -* `List of all countries in all languages `_ +* `List of all countries in all languages `_ |OK_ICON| -* `National Weather Service GIS Data Portal `_ +* `National Weather Service GIS Data Portal `_ |OK_ICON| -* `Natural Earth - vectors and rasters of the world `_ +* `Natural Earth - vectors and rasters of the world `_ |OK_ICON| -* `OpenAddresses `_ +* `OpenAddresses `_ |OK_ICON| -* `OpenStreetMap (OSM) `_ +* `OpenStreetMap (OSM) `_ |OK_ICON| -* `Pleiades - Gazetteer and graph of ancient places `_ +* `Pleiades - Gazetteer and graph of ancient places `_ |OK_ICON| -* `Reverse Geocoder using OSM data `_ +* `Reverse Geocoder using OSM data `_ |OK_ICON| -* `TIGER/Line - U.S. boundaries and roads `_ +* `TIGER/Line - U.S. boundaries and roads `_ |FIXME_ICON| -* `TZ Timezones shapfiles `_ +* `TZ Timezones shapfiles `_ |OK_ICON| -* `TwoFishes - Foursquare's coarse geocoder `_ +* `TwoFishes - Foursquare's coarse geocoder `_ |OK_ICON| -* `UN Environmental Data `_ +* `UN Environmental Data `_ |OK_ICON| -* `World boundaries from the U.S. Department of State `_ +* `World boundaries from the U.S. Department of State `_ |FIXME_ICON| -* `World countries in multiple formats `_ +* `World countries in multiple formats `_ |OK_ICON| Government ---------- -* `Alberta, Province of Canada `_ +* `Alberta, Province of Canada `_ |OK_ICON| -* `Antwerp, Belgium `_ +* `Antwerp, Belgium `_ |OK_ICON| -* `Argentina (non official) `_ +* `Argentina (non official) `_ |OK_ICON| -* `Argentina `_ +* `Argentina `_ |FIXME_ICON| -* `Austin, TX, US `_ +* `Austin, TX, US `_ |OK_ICON| -* `Australia (abs.gov.au) `_ +* `Australia (abs.gov.au) `_ |OK_ICON| -* `Australia (data.gov.au) `_ +* `Australia (data.gov.au) `_ |OK_ICON| -* `Austria (data.gv.at) `_ +* `Austria (data.gv.at) `_ |OK_ICON| -* `Baton Rouge, LA, US `_ +* `Baton Rouge, LA, US `_ |OK_ICON| -* `Belgium `_ +* `Belgium `_ |OK_ICON| -* `Brazil `_ +* `Brazil `_ |OK_ICON| -* `Buenos Aires, Argentina `_ +* `Buenos Aires, Argentina `_ |OK_ICON| -* `Calgary, AB, Canada `_ +* `Calgary, AB, Canada `_ |FIXME_ICON| -* `Cambridge, MA, US `_ +* `Cambridge, MA, US `_ |OK_ICON| -* `Canada `_ +* `Canada `_ |FIXME_ICON| -* `Chicago `_ +* `Chicago `_ |OK_ICON| -* `Chile `_ +* `Chile `_ |OK_ICON| -* `Dallas Open Data `_ +* `Dallas Open Data `_ |OK_ICON| -* `DataBC - data from the Province of British Columbia `_ +* `DataBC - data from the Province of British Columbia `_ |OK_ICON| -* `Denver Open Data `_ +* `Denver Open Data `_ |OK_ICON| -* `Durham, NC Open Data `_ +* `Durham, NC Open Data `_ |OK_ICON| -* `Edmonton, AB, Canada `_ +* `Edmonton, AB, Canada `_ |OK_ICON| -* `England LGInform `_ +* `England LGInform `_ |OK_ICON| -* `EuroStat `_ +* `EuroStat `_ |OK_ICON| -* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ +* `EveryPolitician - Ongoing project collating and sharing data on every politician. `_ |OK_ICON| -* `FedStats `_ +* `FedStats `_ |OK_ICON| -* `Finland `_ +* `Finland `_ |OK_ICON| -* `France `_ +* `France `_ |OK_ICON| -* `Fredericton, NB, Canada `_ +* `Fredericton, NB, Canada `_ |OK_ICON| -* `Gatineau, QC, Canada `_ +* `Gatineau, QC, Canada `_ |OK_ICON| -* `Germany `_ +* `Germany `_ |OK_ICON| -* `Ghent, Belgium `_ +* `Ghent, Belgium `_ |FIXME_ICON| -* `Glasgow, Scotland, UK `_ +* `Glasgow, Scotland, UK `_ |FIXME_ICON| -* `Greece `_ +* `Greece `_ |OK_ICON| -* `Guardian world governments `_ +* `Guardian world governments `_ |OK_ICON| -* `Halifax, NS, Canada `_ +* `Halifax, NS, Canada `_ |FIXME_ICON| -* `Helsinki Region, Finland `_ +* `Helsinki Region, Finland `_ |OK_ICON| -* `Hong Kong, China `_ +* `Hong Kong, China `_ |OK_ICON| -* `Houston Open Data `_ +* `Houston Open Data `_ |FIXME_ICON| -* `Indian Government Data `_ +* `Indian Government Data `_ |OK_ICON| -* `Indonesian Data Portal `_ +* `Indonesian Data Portal `_ |OK_ICON| -* `Ireland's Open Data Portal `_ +* `Ireland's Open Data Portal `_ |OK_ICON| -* `Japan `_ +* `Japan `_ |OK_ICON| -* `Laval, QC, Canada `_ +* `Laval, QC, Canada `_ |OK_ICON| -* `Lexington, KY `_ +* `Lexington, KY `_ |OK_ICON| -* `London Datastore, UK `_ +* `London Datastore, UK `_ |OK_ICON| -* `London, ON, Canada `_ +* `London, ON, Canada `_ |OK_ICON| -* `Los Angeles Open Data `_ +* `Los Angeles Open Data `_ |OK_ICON| -* `MassGIS, Massachusetts, U.S. `_ +* `MassGIS, Massachusetts, U.S. `_ |OK_ICON| -* `Metropolitain Transportation Commission (MTC), California, US `_ +* `Metropolitain Transportation Commission (MTC), California, US `_ |OK_ICON| -* `Mexico `_ +* `Mexico `_ |OK_ICON| -* `Missisauga, ON, Canada `_ +* `Missisauga, ON, Canada `_ |OK_ICON| -* `Moldova `_ +* `Moldova `_ |OK_ICON| -* `Moncton, NB, Canada `_ +* `Moncton, NB, Canada `_ |OK_ICON| -* `Montreal, QC, Canada `_ +* `Montreal, QC, Canada `_ |OK_ICON| -* `Mountain View, California, US (GIS) `_ +* `Mountain View, California, US (GIS) `_ |OK_ICON| -* `NYC Open Data `_ +* `NYC Open Data `_ |FIXME_ICON| -* `NYC betanyc `_ +* `NYC betanyc `_ |OK_ICON| -* `Netherlands `_ +* `Netherlands `_ |OK_ICON| -* `New Zealand `_ +* `New Zealand `_ |OK_ICON| -* `OECD `_ +* `OECD `_ |OK_ICON| -* `Oakland, California, US `_ +* `Oakland, California, US `_ |OK_ICON| -* `Oklahoma `_ +* `Oklahoma `_ |OK_ICON| -* `Open Data for Africa `_ +* `Open Data for Africa `_ |OK_ICON| -* `Open Government Data (OGD) Platform India `_ +* `Open Government Data (OGD) Platform India `_ |OK_ICON| -* `OpenDataSoft's list of 1,600 open data `_ +* `OpenDataSoft's list of 1,600 open data `_ |OK_ICON| -* `Oregon `_ +* `Oregon `_ |OK_ICON| -* `Ottawa, ON, Canada `_ +* `Ottawa, ON, Canada `_ |OK_ICON| -* `Palo Alto, California, US `_ +* `Palo Alto, California, US `_ |OK_ICON| -* `Portland, Oregon `_ +* `Portland, Oregon `_ |OK_ICON| -* `Portugal - Pordata organization `_ +* `Portugal - Pordata organization `_ |OK_ICON| -* `Puerto Rico Government `_ +* `Puerto Rico Government `_ |OK_ICON| -* `Quebec City, QC, Canada `_ +* `Quebec City, QC, Canada `_ |OK_ICON| -* `Quebec Province of Canada `_ +* `Quebec Province of Canada `_ |OK_ICON| -* `Regina SK, Canada `_ +* `Regina SK, Canada `_ |OK_ICON| -* `Rio de Janeiro, Brazil `_ +* `Rio de Janeiro, Brazil `_ |FIXME_ICON| -* `Romania `_ +* `Romania `_ |OK_ICON| -* `Russia `_ +* `Russia `_ |OK_ICON| -* `San Francisco Data sets `_ +* `San Francisco Data sets `_ |OK_ICON| -* `San Jose, California, US `_ +* `San Jose, California, US `_ |OK_ICON| -* `San Mateo County, California, US `_ +* `San Mateo County, California, US `_ |OK_ICON| -* `Saskatchewan, Province of Canada `_ +* `Saskatchewan, Province of Canada `_ |OK_ICON| -* `Seattle `_ +* `Seattle `_ |OK_ICON| -* `Singapore Government Data `_ +* `Singapore Government Data `_ |OK_ICON| -* `South Africa Trade Statistics `_ +* `South Africa Trade Statistics `_ |OK_ICON| -* `South Africa `_ +* `South Africa `_ |OK_ICON| -* `State of Utah, US `_ +* `State of Utah, US `_ |OK_ICON| -* `Switzerland `_ +* `Switzerland `_ |OK_ICON| -* `Taiwan g0v `_ +* `Taiwan g0v `_ |OK_ICON| -* `Taiwan `_ +* `Taiwan `_ |OK_ICON| -* `Texas Open Data `_ +* `Texas Open Data `_ |OK_ICON| -* `The World Bank `_ +* `The World Bank `_ |FIXME_ICON| -* `Toronto, ON, Canada `_ +* `Toronto, ON, Canada `_ |OK_ICON| -* `Tunisia `_ +* `Tunisia `_ |OK_ICON| -* `U.K. Government Data `_ +* `U.K. Government Data `_ |OK_ICON| -* `U.S. American Community Survey `_ +* `U.S. American Community Survey `_ |OK_ICON| -* `U.S. CDC Public Health datasets `_ +* `U.S. CDC Public Health datasets `_ |OK_ICON| -* `U.S. Census Bureau `_ +* `U.S. Census Bureau `_ |OK_ICON| -* `U.S. Department of Housing and Urban Development (HUD) `_ +* `U.S. Department of Housing and Urban Development (HUD) `_ |OK_ICON| -* `U.S. Federal Government Agencies `_ +* `U.S. Federal Government Agencies `_ |OK_ICON| -* `U.S. Federal Government Data Catalog `_ +* `U.S. Federal Government Data Catalog `_ |OK_ICON| -* `U.S. Food and Drug Administration (FDA) `_ +* `U.S. Food and Drug Administration (FDA) `_ |OK_ICON| -* `U.S. National Center for Education Statistics (NCES) `_ +* `U.S. National Center for Education Statistics (NCES) `_ |OK_ICON| -* `U.S. Open Government `_ +* `U.S. Open Government `_ |OK_ICON| -* `UK 2011 Census Open Atlas Project `_ +* `UK 2011 Census Open Atlas Project `_ |FIXME_ICON| -* `Uganda Bureau of Statistics `_ +* `Uganda Bureau of Statistics `_ |OK_ICON| -* `United Nations `_ +* `United Nations `_ |OK_ICON| -* `Uruguay `_ +* `Uruguay `_ |OK_ICON| -* `Valley Transportation Authority (VTA), California, US `_ +* `Valley Transportation Authority (VTA), California, US `_ |OK_ICON| -* `Vancouver, BC Open Data Catalog `_ +* `Vancouver, BC Open Data Catalog `_ |OK_ICON| -* `Victoria, BC, Canada `_ +* `Victoria, BC, Canada `_ |FIXME_ICON| -* `Vienna, Austria `_ +* `Vienna, Austria `_ |OK_ICON| Healthcare ---------- -* `EHDP Large Health Data Sets `_ +* `EHDP Large Health Data Sets `_ |OK_ICON| -* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ +* `GDC - GDC supports several cancer genome programs for CCG, TCGA, TARGET etc. `_ |OK_ICON| -* `Gapminder World demographic databases `_ +* `Gapminder World demographic databases `_ |OK_ICON| -* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ +* `MeSH, the vocabulary thesaurus used for indexing articles for PubMed `_ |OK_ICON| -* `Medicare Coverage Database (MCD), U.S. `_ +* `Medicare Coverage Database (MCD), U.S. `_ |OK_ICON| -* `Medicare Data Engine of medicare.gov Data `_ +* `Medicare Data Engine of medicare.gov Data `_ |OK_ICON| -* `Medicare Data File `_ +* `Medicare Data File `_ |OK_ICON| -* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ +* `Number of Ebola Cases and Deaths in Affected Countries (2014) `_ |FIXME_ICON| -* `Open-ODS (structure of the UK NHS) `_ +* `Open-ODS (structure of the UK NHS) `_ |OK_ICON| -* `OpenPaymentsData, Healthcare financial relationship data `_ +* `OpenPaymentsData, Healthcare financial relationship data `_ |OK_ICON| -* `PhysioBank Databases - A large and growing archive of physiological data. `_ +* `PhysioBank Databases - A large and growing archive of physiological data. `_ |OK_ICON| -* `The Cancer Genome Atlas project (TCGA) `_ +* `The Cancer Genome Atlas project (TCGA) `_ |OK_ICON| -* `World Health Organization Global Health Observatory `_ +* `World Health Organization Global Health Observatory `_ |OK_ICON| ImageProcessing --------------- -* `10k US Adult Faces Database `_ +* `10k US Adult Faces Database `_ |OK_ICON| -* `2GB of Photos of Cats `_ +* `2GB of Photos of Cats `_ |FIXME_ICON| -* `Adience Unfiltered faces for gender and age classification `_ +* `Adience Unfiltered faces for gender and age classification `_ |OK_ICON| -* `Affective Image Classification `_ +* `Affective Image Classification `_ |OK_ICON| -* `Animals with attributes `_ +* `Animals with attributes `_ |OK_ICON| -* `Caltech Pedestrian Detection Benchmark `_ +* `Caltech Pedestrian Detection Benchmark `_ |OK_ICON| -* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ +* `Chars74K dataset - Character Recognition in Natural Images (both English and Kannada are available) `_ |OK_ICON| -* `Face Recognition Benchmark `_ +* `Face Recognition Benchmark `_ |OK_ICON| -* `Flickr: 32 Class Brand Logos `_ +* `Flickr: 32 Class Brand Logos `_ |OK_ICON| -* `GDXray - X-ray images for X-ray testing and Computer Vision `_ +* `GDXray - X-ray images for X-ray testing and Computer Vision `_ |OK_ICON| -* `ImageNet (in WordNet hierarchy) `_ +* `ImageNet (in WordNet hierarchy) `_ |OK_ICON| -* `Indoor Scene Recognition `_ +* `Indoor Scene Recognition `_ |OK_ICON| -* `International Affective Picture System, UFL `_ +* `International Affective Picture System, UFL `_ |OK_ICON| -* `MNIST database of handwritten digits, near 1 million examples `_ +* `MNIST database of handwritten digits, near 1 million examples `_ |OK_ICON| -* `Massive Visual Memory Stimuli, MIT `_ +* `Massive Visual Memory Stimuli, MIT `_ |OK_ICON| -* `SUN database, MIT `_ +* `SUN database, MIT `_ |OK_ICON| -* `Several Shape-from-Silhouette Datasets `_ +* `Several Shape-from-Silhouette Datasets `_ |FIXME_ICON| -* `Stanford Dogs Dataset `_ +* `Stanford Dogs Dataset `_ |OK_ICON| -* `The Action Similarity Labeling (ASLAN) Challenge `_ +* `The Action Similarity Labeling (ASLAN) Challenge `_ |OK_ICON| -* `The Oxford-IIIT Pet Dataset `_ +* `The Oxford-IIIT Pet Dataset `_ |OK_ICON| -* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ +* `Violent-Flows - Crowd Violence / Non-violence Database and benchmark `_ |OK_ICON| -* `Visual genome `_ +* `Visual genome `_ |OK_ICON| -* `YouTube Faces Database `_ +* `YouTube Faces Database `_ |OK_ICON| MachineLearning --------------- -* `Context-aware data sets from five domains `_ +* `Context-aware data sets from five domains `_ |OK_ICON| -* `Delve Datasets for classification and regression `_ +* `Delve Datasets for classification and regression `_ |OK_ICON| -* `Discogs Monthly Data `_ +* `Discogs Monthly Data `_ |OK_ICON| -* `Free Music Archive `_ +* `Free Music Archive `_ |OK_ICON| -* `IMDb Database `_ +* `IMDb Database `_ |OK_ICON| -* `Keel Repository for classification, regression and time series `_ +* `Keel Repository for classification, regression and time series `_ |OK_ICON| -* `Labeled Faces in the Wild (LFW) `_ +* `Labeled Faces in the Wild (LFW) `_ |OK_ICON| -* `Lending Club Loan Data `_ +* `Lending Club Loan Data `_ |OK_ICON| -* `Machine Learning Data Set Repository `_ +* `Machine Learning Data Set Repository `_ |OK_ICON| -* `Million Song Dataset `_ +* `Million Song Dataset `_ |OK_ICON| -* `More Song Datasets `_ +* `More Song Datasets `_ |OK_ICON| -* `MovieLens Data Sets `_ +* `MovieLens Data Sets `_ |OK_ICON| -* `New Yorker caption contest ratings `_ +* `New Yorker caption contest ratings `_ |OK_ICON| -* `RDataMining - "R and Data Mining" ebook data `_ +* `RDataMining - "R and Data Mining" ebook data `_ |OK_ICON| -* `Registered Meteorites on Earth `_ +* `Registered Meteorites on Earth `_ |OK_ICON| -* `Restaurants Health Score Data in San Francisco `_ +* `Restaurants Health Score Data in San Francisco `_ |FIXME_ICON| -* `UCI Machine Learning Repository `_ +* `UCI Machine Learning Repository `_ |OK_ICON| -* `Yahoo! Ratings and Classification Data `_ +* `Yahoo! Ratings and Classification Data `_ |FIXME_ICON| -* `Youtube 8m `_ +* `Youtube 8m `_ |OK_ICON| -* `eBay Online Auctions (2012) `_ +* `eBay Online Auctions (2012) `_ |OK_ICON| Museums ------- -* `Canada Science and Technology Museums Corporation's Open Data `_ +* `Canada Science and Technology Museums Corporation's Open Data `_ |OK_ICON| -* `Cooper-Hewitt's Collection Database `_ +* `Cooper-Hewitt's Collection Database `_ |OK_ICON| -* `Minneapolis Institute of Arts metadata `_ +* `Minneapolis Institute of Arts metadata `_ |OK_ICON| -* `Natural History Museum (London) Data Portal `_ +* `Natural History Museum (London) Data Portal `_ |OK_ICON| -* `Rijksmuseum Historical Art Collection `_ +* `Rijksmuseum Historical Art Collection `_ |OK_ICON| -* `Tate Collection metadata `_ +* `Tate Collection metadata `_ |OK_ICON| -* `The Getty vocabularies `_ +* `The Getty vocabularies `_ |OK_ICON| NaturalLanguage --------------- -* `Automatic Keyphrase Extraction `_ +* `Automatic Keyphrase Extraction `_ |OK_ICON| -* `Blogger Corpus `_ +* `Blogger Corpus `_ |OK_ICON| -* `CLiPS Stylometry Investigation Corpus `_ +* `CLiPS Stylometry Investigation Corpus `_ |OK_ICON| -* `ClueWeb09 FACC `_ +* `ClueWeb09 FACC `_ |OK_ICON| -* `ClueWeb12 FACC `_ +* `ClueWeb12 FACC `_ |OK_ICON| -* `DBpedia - 4.58M things with 583M facts `_ +* `DBpedia - 4.58M things with 583M facts `_ |OK_ICON| -* `Flickr Personal Taxonomies `_ +* `Flickr Personal Taxonomies `_ |OK_ICON| -* `Freebase of people, places, and things `_ +* `Freebase of people, places, and things `_ |OK_ICON| -* `Google Books Ngrams (2.2TB) `_ +* `Google Books Ngrams (2.2TB) `_ |OK_ICON| -* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ +* `Google MC-AFP - Generated based on the public available Gigaword dataset using Paragraph Vectors `_ |OK_ICON| -* `Google Web 5gram (1TB, 2006) `_ +* `Google Web 5gram (1TB, 2006) `_ |OK_ICON| -* `Gutenberg eBooks List `_ +* `Gutenberg eBooks List `_ |OK_ICON| -* `Hansards text chunks of Canadian Parliament `_ +* `Hansards text chunks of Canadian Parliament `_ |OK_ICON| -* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ +* `Microsoft MAchine Reading COmprehension Dataset (or MS MARCO) `_ |OK_ICON| -* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ |OK_ICON| -* `Machine Translation of European languages `_ +* `Machine Translation of European languages `_ |OK_ICON| -* `Making Sense of Microposts 2013 - Concept Extraction `_ +* `Making Sense of Microposts 2013 - Concept Extraction `_ |FIXME_ICON| -* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ +* `Making Sense of Microposts 2016 - Named Entity rEcognition and Linking `_ |OK_ICON| -* `Multi-Domain Sentiment Dataset (version 2.0) `_ +* `Multi-Domain Sentiment Dataset (version 2.0) `_ |OK_ICON| -* `Open Multilingual Wordnet `_ +* `Open Multilingual Wordnet `_ |OK_ICON| -* `POS/NER/Chunk annotated data `_ +* `POS/NER/Chunk annotated data `_ |OK_ICON| -* `Personae Corpus `_ +* `Personae Corpus `_ |OK_ICON| -* `SMS Spam Collection in English `_ +* `SMS Spam Collection in English `_ |OK_ICON| -* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ +* `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ |OK_ICON| -* `Stanford Question Answering Dataset (SQuAD) `_ +* `Stanford Question Answering Dataset (SQuAD) `_ |OK_ICON| -* `USENET postings corpus of 2005~2011 `_ +* `USENET postings corpus of 2005~2011 `_ |OK_ICON| -* `Universal Dependencies `_ +* `Universal Dependencies `_ |OK_ICON| -* `Webhose - News/Blogs in multiple languages `_ +* `Webhose - News/Blogs in multiple languages `_ |OK_ICON| -* `Wikidata - Wikipedia databases `_ +* `Wikidata - Wikipedia databases `_ |OK_ICON| -* `Wikipedia Links data - 40 Million Entities in Context `_ +* `Wikipedia Links data - 40 Million Entities in Context `_ |OK_ICON| -* `WordNet databases and tools `_ +* `WordNet databases and tools `_ |OK_ICON| Neuroscience ------------ -* `Allen Institute Datasets `_ +* `Allen Institute Datasets `_ |OK_ICON| -* `Brain Catalogue `_ +* `Brain Catalogue `_ |OK_ICON| -* `Brainomics `_ +* `Brainomics `_ |OK_ICON| -* `CodeNeuro Datasets `_ +* `CodeNeuro Datasets `_ |OK_ICON| -* `Collaborative Research in Computational Neuroscience (CRCNS) `_ +* `Collaborative Research in Computational Neuroscience (CRCNS) `_ |OK_ICON| -* `FCP-INDI `_ +* `FCP-INDI `_ |OK_ICON| -* `Human Connectome Project `_ +* `Human Connectome Project `_ |OK_ICON| -* `NDAR `_ +* `NDAR `_ |OK_ICON| -* `NIMH Data Archive `_ +* `NIMH Data Archive `_ |OK_ICON| -* `NeuroData `_ +* `NeuroData `_ |OK_ICON| -* `Neuroelectro `_ +* `Neuroelectro `_ |OK_ICON| -* `OASIS `_ +* `OASIS `_ |OK_ICON| -* `OpenfMRI `_ +* `OpenfMRI `_ |OK_ICON| -* `Study Forrest `_ +* `Study Forrest `_ |OK_ICON| Physics ------- -* `CERN Open Data Portal `_ +* `CERN Open Data Portal `_ |OK_ICON| -* `Crystallography Open Database `_ +* `Crystallography Open Database `_ |OK_ICON| -* `NASA Exoplanet Archive `_ +* `NASA Exoplanet Archive `_ |OK_ICON| -* `NSSDC (NASA) data of 550 space spacecraft `_ +* `NSSDC (NASA) data of 550 space spacecraft `_ |OK_ICON| -* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ +* `Sloan Digital Sky Survey (SDSS) - Mapping the Universe `_ |OK_ICON| Psychology+Cognition -------------------- -* `OSU Cognitive Modeling Repository Datasets `_ +* `OSU Cognitive Modeling Repository Datasets `_ |FIXME_ICON| PublicDomains ------------- -* `Amazon `_ +* `Amazon `_ |OK_ICON| -* `Archive.org Datasets `_ +* `Archive.org Datasets `_ |OK_ICON| -* `Archive-it from Internet Archive `_ +* `Archive-it from Internet Archive `_ |OK_ICON| -* `CMU JASA data archive `_ +* `CMU JASA data archive `_ |OK_ICON| -* `CMU StatLab collections `_ +* `CMU StatLab collections `_ |OK_ICON| -* `Data.World `_ +* `Data.World `_ |OK_ICON| -* `Data360 `_ +* `Data360 `_ |OK_ICON| -* `Enigma Public `_ +* `Enigma Public `_ |OK_ICON| -* `Google `_ +* `Google `_ |OK_ICON| -* `Infochimps `_ +* `Infochimps `_ |FIXME_ICON| -* `KDNuggets Data Collections `_ +* `KDNuggets Data Collections `_ |OK_ICON| -* `Microsoft Azure Data Market Free DataSets `_ +* `Microsoft Azure Data Market Free DataSets `_ |OK_ICON| -* `Microsoft Data Science for Research `_ +* `Microsoft Data Science for Research `_ |OK_ICON| -* `Numbray `_ +* `Numbray `_ |FIXME_ICON| -* `Open Library Data Dumps `_ +* `Open Library Data Dumps `_ |OK_ICON| -* `Reddit Datasets `_ +* `Reddit Datasets `_ |OK_ICON| -* `RevolutionAnalytics Collection `_ +* `RevolutionAnalytics Collection `_ |OK_ICON| -* `Sample R data sets `_ +* `Sample R data sets `_ |OK_ICON| -* `StatSci.org `_ +* `StatSci.org `_ |OK_ICON| -* `Stats4Stem R data sets `_ +* `Stats4Stem R data sets `_ |FIXME_ICON| -* `The Washington Post List `_ +* `The Washington Post List `_ |OK_ICON| -* `UCLA SOCR data collection `_ +* `UCLA SOCR data collection `_ |OK_ICON| -* `UFO Reports `_ +* `UFO Reports `_ |OK_ICON| -* `Wikileaks 911 pager intercepts `_ +* `Wikileaks 911 pager intercepts `_ |OK_ICON| -* `Yahoo Webscope `_ +* `Yahoo Webscope `_ |FIXME_ICON| SearchEngines ------------- -* `Academic Torrents of data sharing from UMB `_ +* `Academic Torrents of data sharing from UMB `_ |OK_ICON| -* `DataMarket (Qlik) `_ +* `DataMarket (Qlik) `_ |OK_ICON| -* `Datahub.io `_ +* `Datahub.io `_ |OK_ICON| -* `Harvard Dataverse Network of scientific data `_ +* `Harvard Dataverse Network of scientific data `_ |OK_ICON| -* `ICPSR (UMICH) `_ +* `ICPSR (UMICH) `_ |OK_ICON| -* `Institute of Education Sciences `_ +* `Institute of Education Sciences `_ |OK_ICON| -* `National Technical Reports Library `_ +* `National Technical Reports Library `_ |FIXME_ICON| -* `Open Data Certificates (beta) `_ +* `Open Data Certificates (beta) `_ |OK_ICON| -* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ +* `OpenDataNetwork - A search engine of all Socrata powered data portals `_ |OK_ICON| -* `Statista.com - statistics and Studies `_ +* `Statista.com - statistics and Studies `_ |OK_ICON| -* `Zenodo - An open dependable home for the long-tail of science `_ +* `Zenodo - An open dependable home for the long-tail of science `_ |OK_ICON| SocialNetworks -------------- -* `72 hours #gamergate Twitter Scrape `_ +* `72 hours #gamergate Twitter Scrape `_ |OK_ICON| -* `Ancestry.com Forum Dataset over 10 years `_ +* `Ancestry.com Forum Dataset over 10 years `_ |OK_ICON| -* `CMU Enron Email of 150 users `_ +* `CMU Enron Email of 150 users `_ |OK_ICON| -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ |OK_ICON| -* `EDRM Enron EMail of 151 users, hosted on S3 `_ +* `EDRM Enron EMail of 151 users, hosted on S3 `_ |OK_ICON| -* `Facebook Data Scrape (2005) `_ +* `Facebook Data Scrape (2005) `_ |OK_ICON| -* `Facebook Social Networks from LAW (since 2007) `_ +* `Facebook Social Networks from LAW (since 2007) `_ |OK_ICON| -* `Foursquare from UMN/Sarwat (2013) `_ +* `Foursquare from UMN/Sarwat (2013) `_ |OK_ICON| -* `GitHub Collaboration Archive `_ +* `GitHub Collaboration Archive `_ |OK_ICON| -* `Google Scholar citation relations `_ +* `Google Scholar citation relations `_ |OK_ICON| -* `High-Resolution Contact Networks from Wearable Sensors `_ +* `High-Resolution Contact Networks from Wearable Sensors `_ |OK_ICON| -* `Indie Map: social graph and crawl of top IndieWeb sites `_ +* `Indie Map: social graph and crawl of top IndieWeb sites `_ |OK_ICON| -* `Mobile Social Networks from UMASS `_ +* `Mobile Social Networks from UMASS `_ |OK_ICON| -* `Network Twitter Data `_ +* `Network Twitter Data `_ |OK_ICON| -* `Reddit Comments `_ +* `Reddit Comments `_ |OK_ICON| -* `Skytrax' Air Travel Reviews Dataset `_ +* `Skytrax' Air Travel Reviews Dataset `_ |OK_ICON| -* `Social Twitter Data `_ +* `Social Twitter Data `_ |OK_ICON| -* `SourceForge.net Research Data `_ +* `SourceForge.net Research Data `_ |OK_ICON| -* `Twitter Data for Online Reputation Management `_ +* `Twitter Data for Online Reputation Management `_ |OK_ICON| -* `Twitter Data for Sentiment Analysis `_ +* `Twitter Data for Sentiment Analysis `_ |OK_ICON| -* `Twitter Graph of entire Twitter site `_ +* `Twitter Graph of entire Twitter site `_ |OK_ICON| -* `Twitter Scrape Calufa May 2011 `_ +* `Twitter Scrape Calufa May 2011 `_ |FIXME_ICON| -* `UNIMI/LAW Social Network Datasets `_ +* `UNIMI/LAW Social Network Datasets `_ |OK_ICON| -* `Yahoo! Graph and Social Data `_ +* `Yahoo! Graph and Social Data `_ |FIXME_ICON| -* `Youtube Video Social Graph in 2007,2008 `_ +* `Youtube Video Social Graph in 2007,2008 `_ |OK_ICON| SocialSciences -------------- -* `ACLED (Armed Conflict Location & Event Data Project) `_ +* `ACLED (Armed Conflict Location & Event Data Project) `_ |OK_ICON| -* `Canadian Legal Information Institute `_ +* `Canadian Legal Information Institute `_ |FIXME_ICON| -* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ +* `Center for Systemic Peace Datasets - Conflict Trends, Polities, State Fragility, etc `_ |OK_ICON| -* `Correlates of War Project `_ +* `Correlates of War Project `_ |OK_ICON| -* `Cryptome Conspiracy Theory Items `_ +* `Cryptome Conspiracy Theory Items `_ |OK_ICON| -* `Datacards `_ +* `Datacards `_ |FIXME_ICON| -* `European Social Survey `_ +* `European Social Survey `_ |OK_ICON| -* `FBI Hate Crime 2013 - aggregated data `_ +* `FBI Hate Crime 2013 - aggregated data `_ |OK_ICON| -* `Fragile States Index `_ +* `Fragile States Index `_ |FIXME_ICON| -* `GDELT Global Events Database `_ +* `GDELT Global Events Database `_ |OK_ICON| -* `General Social Survey (GSS) since 1972 `_ +* `General Social Survey (GSS) since 1972 `_ |OK_ICON| -* `German Social Survey `_ +* `German Social Survey `_ |OK_ICON| -* `Global Religious Futures Project `_ +* `Global Religious Futures Project `_ |OK_ICON| -* `Humanitarian Data Exchange `_ +* `Humanitarian Data Exchange `_ |FIXME_ICON| -* `INFORM Index for Risk Management `_ +* `INFORM Index for Risk Management `_ |OK_ICON| -* `Institute for Demographic Studies `_ +* `Institute for Demographic Studies `_ |OK_ICON| -* `International Networks Archive `_ +* `International Networks Archive `_ |OK_ICON| -* `International Social Survey Program ISSP `_ +* `International Social Survey Program ISSP `_ |OK_ICON| -* `International Studies Compendium Project `_ +* `International Studies Compendium Project `_ |OK_ICON| -* `James McGuire Cross National Data `_ +* `James McGuire Cross National Data `_ |OK_ICON| -* `MIT Reality Mining Dataset `_ +* `MIT Reality Mining Dataset `_ |OK_ICON| -* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ +* `MacroData Guide by Norsk samfunnsvitenskapelig datatjeneste `_ |OK_ICON| -* `Minnesota Population Center `_ +* `Minnesota Population Center `_ |OK_ICON| -* `Notre Dame Global Adaptation Index (NG-DAIN) `_ +* `Notre Dame Global Adaptation Index (NG-DAIN) `_ |OK_ICON| -* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ +* `Open Crime and Policing Data in England, Wales and Northern Ireland `_ |OK_ICON| -* `Paul Hensel General International Data Page `_ +* `Paul Hensel General International Data Page `_ |OK_ICON| -* `PewResearch Internet Survey Project `_ +* `PewResearch Internet Survey Project `_ |FIXME_ICON| -* `PewResearch Society Data Collection `_ +* `PewResearch Society Data Collection `_ |OK_ICON| -* `Political Polarity Data `_ +* `Political Polarity Data `_ |OK_ICON| -* `StackExchange Data Explorer `_ +* `StackExchange Data Explorer `_ |OK_ICON| -* `Terrorism Research and Analysis Consortium `_ +* `Terrorism Research and Analysis Consortium `_ |OK_ICON| -* `Texas Inmates Executed Since 1984 `_ +* `Texas Inmates Executed Since 1984 `_ |FIXME_ICON| -* `Titanic Survival Data Set `_ +* `Titanic Survival Data Set `_ |OK_ICON| -* `UCB's Archive of Social Science Data (D-Lab) `_ +* `UCB's Archive of Social Science Data (D-Lab) `_ |OK_ICON| -* `UCLA Social Sciences Data Archive `_ +* `UCLA Social Sciences Data Archive `_ |FIXME_ICON| -* `UN Civil Society Database `_ +* `UN Civil Society Database `_ |OK_ICON| -* `UPJOHN for Labor Employment Research `_ +* `UPJOHN for Labor Employment Research `_ |OK_ICON| -* `Universities Worldwide `_ +* `Universities Worldwide `_ |OK_ICON| -* `Uppsala Conflict Data Program `_ +* `Uppsala Conflict Data Program `_ |OK_ICON| -* `World Bank Open Data `_ +* `World Bank Open Data `_ |OK_ICON| -* `WorldPop project - Worldwide human population distributions `_ +* `WorldPop project - Worldwide human population distributions `_ |OK_ICON| Software -------- -* `FLOSSmole data about free, libre, and open source software development `_ +* `FLOSSmole data about free, libre, and open source software development `_ |OK_ICON| Sports ------ -* `Betfair Historical Exchange Data `_ +* `Betfair Historical Exchange Data `_ |OK_ICON| -* `Cricsheet Matches (cricket) `_ +* `Cricsheet Matches (cricket) `_ |OK_ICON| -* `Ergast Formula 1, from 1950 up to date (API) `_ +* `Ergast Formula 1, from 1950 up to date (API) `_ |OK_ICON| -* `Football/Soccer resources (data and APIs) `_ +* `Football/Soccer resources (data and APIs) `_ |OK_ICON| -* `Lahman's Baseball Database `_ +* `Lahman's Baseball Database `_ |OK_ICON| -* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ +* `Pinhooker: Thoroughbred Bloodstock Sale Data `_ |OK_ICON| -* `Retrosheet Baseball Statistics `_ +* `Retrosheet Baseball Statistics `_ |OK_ICON| -* `Tennis database of rankings, results, and stats for ATP `_ +* `Tennis database of rankings, results, and stats for ATP `_ |OK_ICON| TimeSeries ---------- -* `Databanks International Cross National Time Series Data Archive `_ +* `Databanks International Cross National Time Series Data Archive `_ |OK_ICON| -* `Hard Drive Failure Rates `_ +* `Hard Drive Failure Rates `_ |OK_ICON| -* `Heart Rate Time Series from MIT `_ +* `Heart Rate Time Series from MIT `_ |OK_ICON| -* `Time Series Data Library (TSDL) from MU `_ +* `Time Series Data Library (TSDL) from MU `_ |OK_ICON| -* `UC Riverside Time Series Dataset `_ +* `UC Riverside Time Series Dataset `_ |OK_ICON| Transportation -------------- -* `Airlines OD Data 1987-2008 `_ +* `Airlines OD Data 1987-2008 `_ |OK_ICON| -* `Bay Area Bike Share Data `_ +* `Bay Area Bike Share Data `_ |OK_ICON| -* `Bike Share Systems (BSS) collection `_ +* `Bike Share Systems (BSS) collection `_ |OK_ICON| -* `GeoLife GPS Trajectory from Microsoft Research `_ +* `GeoLife GPS Trajectory from Microsoft Research `_ |OK_ICON| -* `German train system by Deutsche Bahn `_ +* `German train system by Deutsche Bahn `_ |OK_ICON| -* `Hubway Million Rides in MA `_ +* `Hubway Million Rides in MA `_ |OK_ICON| -* `Montreal BIXI Bike Share `_ +* `Montreal BIXI Bike Share `_ |OK_ICON| -* `NYC Taxi Trip Data 2009- `_ +* `NYC Taxi Trip Data 2009- `_ |OK_ICON| -* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ +* `NYC Taxi Trip Data 2013 (FOIA/FOILed) `_ |OK_ICON| -* `NYC Uber trip data April 2014 to September 2014 `_ +* `NYC Uber trip data April 2014 to September 2014 `_ |OK_ICON| -* `Open Traffic collection `_ +* `Open Traffic collection `_ |OK_ICON| -* `OpenFlights - airport, airline and route data `_ +* `OpenFlights - airport, airline and route data `_ |OK_ICON| -* `Philadelphia Bike Share Stations (JSON) `_ +* `Philadelphia Bike Share Stations (JSON) `_ |FIXME_ICON| -* `Plane Crash Database, since 1920 `_ +* `Plane Crash Database, since 1920 `_ |OK_ICON| -* `RITA Airline On-Time Performance data `_ +* `RITA Airline On-Time Performance data `_ |OK_ICON| -* `RITA/BTS transport data collection (TranStat) `_ +* `RITA/BTS transport data collection (TranStat) `_ |OK_ICON| -* `Toronto Bike Share Stations (XML file) `_ +* `Toronto Bike Share Stations (XML file) `_ |FIXME_ICON| -* `Transport for London (TFL) `_ +* `Transport for London (TFL) `_ |OK_ICON| -* `Travel Tracker Survey (TTS) for Chicago `_ +* `Travel Tracker Survey (TTS) for Chicago `_ |OK_ICON| -* `U.S. Bureau of Transportation Statistics (BTS) `_ +* `U.S. Bureau of Transportation Statistics (BTS) `_ |OK_ICON| -* `U.S. Domestic Flights 1990 to 2009 `_ +* `U.S. Domestic Flights 1990 to 2009 `_ |OK_ICON| -* `U.S. Freight Analysis Framework since 2007 `_ +* `U.S. Freight Analysis Framework since 2007 `_ |OK_ICON| Complementary Collections