diff --git a/README.rst b/README.rst index ff6a750..1f8b63d 100644 --- a/README.rst +++ b/README.rst @@ -13,8 +13,6 @@ Other amazingly awesome lists can be found in the `awesome-awesomeness `_ and `sindresorhus's awesome `_ list. -* `Visit our Google Group on APD `_ - Agriculture ------------ @@ -339,12 +337,13 @@ Natural Language * `ClueWeb12 FACC `_ * `DBpedia - 4.58M things with 583M facts `_ * `Flickr Personal Taxonomies `_ +* `Freebase.com of people, places, and things `_ * `Google Books Ngrams (2.2TB) `_ * `Google Web 5gram (1TB, 2006) `_ * `Gutenberg eBooks List `_ * `Hansards text chunks of Canadian Parliament `_ -* `Machine Translation of European languages `_ * `Machine Comprehension Test (MCTest) of text from Microsoft Research `_ +* `Machine Translation of European languages `_ * `SaudiNewsNet Collection of Saudi Newspaper Articles (Arabic, 30K articles) `_ * `SMS Spam Collection in English `_ * `USENET postings corpus of 2005~2011 `_ @@ -401,28 +400,18 @@ Search Engines * `Archive-it from Internet Archive `_ * `Datahub.io `_ * `DataMarket (Qlik) `_ -* `Freebase.com of people, places, and things `_ * `Harvard Dataverse Network of scientific data `_ * `ICPSR (UMICH) `_ * `Open Data Certificates (beta) `_ * `Statista.com - statistics and Studies `_ -Social Networks ---------------- - -* `72 hours #gamergate scrape `_ -* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ -* `May 2011 Calufa Twitter Scrape `_ -* `Network Twitter Data `_ -* `Social Twitter Data `_ -* `Twitter Data for Sentiment Analysis `_ - - Social Sciences --------------- +* `72 hours #gamergate scrape `_ * `Ancestry.com Forum Dataset over 10 years `_ +* `Cheng-Caverlee-Lee September 2009 - January 2010 Twitter Scrape `_ * `CMU Enron Email of 150 users `_ * `EDRM Enron EMail of 151 users, hosted on S3 `_ * `Facebook Data Scrape (2005) `_ @@ -436,15 +425,20 @@ Social Sciences * `Google Scholar citation relations `_ * `MIT Reality Mining Dataset `_ * `Mobile Social Networks from UMASS `_ +* `Network Twitter Data `_ * `PewResearch Internet Survey Project `_ +* `PewResearch Society Data Collection `_ * `Political Polarity Data `_ * `Reddit Comments `_ * `Skytrax' Air Travel Reviews Dataset `_ +* `Social Twitter Data `_ * `SourceForge.net Research Data `_ * `StackExchange Data Explorer `_ * `Texas Inmates Executed Since 1984 `_ * `Titanic Survival Data Set `_ +* `Twitter Data for Sentiment Analysis `_ * `Twitter Graph of entire Twitter site `_ +* `Twitter Scrape Calufa May 2011 `_ * `UCB's Archive of Social Science Data (D-Lab) `_ * `UCLA Social Sciences Data Archive `_ * `UNIMI/LAW Social Network Datasets `_