PUMA bookmarks for /tag/datasethttps://puma.uni-kassel.de/tag/datasetPUMA RSS Feed for /tag/datasetWebscope from Yahoo! Labshttp://webscope.sandbox.yahoo.com/catalog.phphotho2011-10-04T17:49:26+02:00language search web dataset <a itemprop="url" data-versiondate="2011-10-04T17:49:26+02:00" href="http://webscope.sandbox.yahoo.com/catalog.php" rel="nofollow" class="description-link">http://webscope.sandbox.yahoo.com/catalog.php</a>MIT Media Lab: Reality Mininghttp://reality.media.mit.edu/hotho2011-09-30T08:49:38+02:00data dm everyaware lab media mining reality traces dataset <a itemprop="url" data-versiondate="2011-09-30T08:49:38+02:00" href="http://reality.media.mit.edu/" rel="nofollow" class="description-link">http://reality.media.mit.edu/</a>Citation Network Datasethttp://arnetminer.org/citationstephandoerfel2011-09-15T15:22:45+02:00arnetminer citation dataset <a itemprop="url" data-versiondate="2011-09-15T15:22:45+02:00" href="http://arnetminer.org/citation" rel="nofollow" class="description-link">http://arnetminer.org/citation</a>Academics - ProsperIf you are interested in doing research on Prosper or using Prosper data in support of your research, please contact us.http://www.prosper.com/about/academics.aspxhotho2011-09-05T17:04:47+02:00data dataset research <span itemprop="description">If you are interested in doing research on Prosper or using Prosper data in support of your research, please contact us.</span>Tweets2011 Twitter CollectionTweets2011 As part of the TREC 2011 microblog track, Twitter provided identifiers for approximately 16 million tweets sampled between January 23rd and February 8th, 2011. The corpus is designed to be a reusable, representative sample of the twittersphere - i.e. both important and spam tweets are included.http://trec.nist.gov/data/tweets/hotho2011-09-02T10:41:42+02:00corpus dataset everyaware twitter <span itemprop="description">Tweets2011 As part of the TREC 2011 microblog track, Twitter provided identifiers for approximately 16 million tweets sampled between January 23rd and February 8th, 2011. The corpus is designed to be a reusable, representative sample of the twittersphere - i.e. both important and spam tweets are included.</span>RecLab Core -http://code.richrelevance.com/reclab-core/hotho2011-05-26T11:48:26+02:00algorithm challenge data dataset development improvement method recommender <a itemprop="url" data-versiondate="2011-05-26T11:48:26+02:00" href="http://code.richrelevance.com/reclab-core/" rel="nofollow" class="description-link">http://code.richrelevance.com/reclab-core/</a>Datasetshttp://d8taplex.com/directory/directory.htmlhotho2011-05-03T10:19:39+02:00dataset series time <a itemprop="url" data-versiondate="2011-05-03T10:19:39+02:00" href="http://d8taplex.com/directory/directory.html" rel="nofollow" class="description-link">http://d8taplex.com/directory/directory.html</a>d8taplexd8taplex helps you discover, visualize and explore data found on the web including time series datahttp://d8taplex.com/hotho2011-05-03T10:18:57+02:00data dataset discovery exploration visualization web <span itemprop="description">d8taplex helps you discover, visualize and explore data found on the web including time series data</span>What is Twitter, a Social Network or a News Media? - WWW'10http://an.kaist.ac.kr/traces/WWW2010.htmlhotho2011-03-24T10:18:26+01:00dataset network social twitter <a itemprop="url" data-versiondate="2011-03-24T10:18:26+01:00" href="http://an.kaist.ac.kr/traces/WWW2010.html" rel="nofollow" class="description-link">http://an.kaist.ac.kr/traces/WWW2010.html</a>Microsoft Research - Speller Challenge DatasetsMicrosoft Research Speller Challengehttp://web-ngram.research.microsoft.com/spellerchallenge/DataSets.aspxbenz2011-03-16T23:23:07+01:00challenge dataset search_engine speller_challenge spelling <span itemprop="description">Microsoft Research Speller Challenge</span>Welcome to Apache Pig!http://pig.apache.org/hotho2011-03-14T18:59:41+01:00analysis dataset datastore large pig <a itemprop="url" data-versiondate="2011-03-14T18:59:41+01:00" href="http://pig.apache.org/" rel="nofollow" class="description-link">http://pig.apache.org/</a>Longman Dictionaries - Dictionaries for ResearchPearson Longman English Language Teaching (Pearson Longman ELT) is a leading educational publisher of quality resources for all ages and abilities across the curriculum, providing solutions for teachers and students.http://www.pearsonlongman.com/dictionaries/research/dict-research.htmlbenz2011-02-18T23:23:09+01:00dataset dictionary disambiguation ldoce <span itemprop="description">Pearson Longman English Language Teaching (Pearson Longman ELT) is a leading educational publisher of quality resources for all ages and abilities across the curriculum, providing solutions for teachers and students.</span>Semantically Annotated Snapshot of the English Wikipedia (SW v.1)http://www.yr-bcn.es/semanticWikipediabenz2011-02-04T16:08:40+01:00semantics dataset wikipedia annotated ontology <a itemprop="url" data-versiondate="2011-02-04T16:08:40+01:00" href="http://www.yr-bcn.es/semanticWikipedia" rel="nofollow" class="description-link">http://www.yr-bcn.es/semanticWikipedia</a>Twapper Keeper - Archive TweetsAllows you to archive and organize your tweets based upon hash tags.http://twapperkeeper.com/benz2011-02-04T16:07:48+01:00dataset twapper twapper_keeper twitter <span itemprop="description">Allows you to archive and organize your tweets based upon hash tags.</span>[twitter-dev] Re: Tweet Corpus creation for NLP researchhttp://www.mail-archive.com/twitter-development-talk@googlegroups.com/msg05715.htmlbenz2011-02-04T16:07:48+01:00dataset twitter <a itemprop="url" data-versiondate="2011-02-04T16:07:48+01:00" href="http://www.mail-archive.com/twitter-development-talk@googlegroups.com/msg05715.html" rel="nofollow" class="description-link">http://www.mail-archive.com/twitter-development-talk@googlegroups.com/msg05715.html</a>Download Wikipedia Category Taxonomyhttp://www.eml-research.de/english/research/nlp/download/wikitaxonomy.phpbenz2011-02-04T16:07:33+01:00categories category_hierarchy dataset download hierarchy ontology taxonomy wikipedia <a itemprop="url" data-versiondate="2011-02-04T16:07:33+01:00" href="http://www.eml-research.de/english/research/nlp/download/wikitaxonomy.php" rel="nofollow" class="description-link">http://www.eml-research.de/english/research/nlp/download/wikitaxonomy.php</a>Online Social Networks Research @MPI-SWShttp://socialnetworks.mpi-sws.org/benz2011-02-04T16:07:28+01:00dataset download misvlove social_network <a itemprop="url" data-versiondate="2011-02-04T16:07:28+01:00" href="http://socialnetworks.mpi-sws.org/" rel="nofollow" class="description-link">http://socialnetworks.mpi-sws.org/</a>Researchhttp://www.p2p.tu-darmstadt.de/research/benz2011-02-04T16:07:27+01:00dataset social_networks socialnetwork <a itemprop="url" data-versiondate="2011-02-04T16:07:27+01:00" href="http://www.p2p.tu-darmstadt.de/research/" rel="nofollow" class="description-link">http://www.p2p.tu-darmstadt.de/research/</a>Extracting Text from Wikipediahttp://evanjones.ca/software/wikipedia2text.htmlbenz2011-02-04T16:07:25+01:00data dataset plain_text python text tool wiki wikipedia <a itemprop="url" data-versiondate="2011-02-04T16:07:25+01:00" href="http://evanjones.ca/software/wikipedia2text.html" rel="nofollow" class="description-link">http://evanjones.ca/software/wikipedia2text.html</a>Twitter data sets for download - Infochimpshttp://infochimps.org/tags/twitterbenz2011-02-04T16:07:23+01:00dataset download twitter <a itemprop="url" data-versiondate="2011-02-04T16:07:23+01:00" href="http://infochimps.org/tags/twitter" rel="nofollow" class="description-link">http://infochimps.org/tags/twitter</a>