PUMA bookmarks for /tag/datasethttps://puma.uni-kassel.de/tag/datasetPUMA RSS Feed for /tag/datasetSocial Spam Detection Benjamin Markines Ciro Cattuto Filippo MenczerSocial Spam Detectionhttp://givealink.org/Site/socialspam.htmlhotho2009-04-01T17:04:55+02:00detection dataset classification bibsonomy spam <span itemprop="description">Social Spam Detection</span>Yahoo datasetshttp://www.stanford.edu/class/cs345a/YahooData.pdfhotho2009-03-13T16:26:34+01:00dataset yahoo <a itemprop="url" data-versiondate="2009-03-13T16:26:34+01:00" href="http://www.stanford.edu/class/cs345a/YahooData.pdf" rel="nofollow" class="description-link">http://www.stanford.edu/class/cs345a/YahooData.pdf</a>CoPhIR - COntent-based Photo Image Retrievalhttp://cophir.isti.cnr.it/hotho2009-03-03T15:25:25+01:00audio dataset flickr ir multimedia search similarity <a itemprop="url" data-versiondate="2009-03-03T15:25:25+01:00" href="http://cophir.isti.cnr.it/" rel="nofollow" class="description-link">http://cophir.isti.cnr.it/</a>Tastes, Ties, and Time: Facebook data release | Berkman Centerllaboration with Harvard sociology graduate stuhttp://cyber.law.harvard.edu/node/4682hotho2009-01-29T15:46:42+01:00Facebook dataset <span itemprop="description">llaboration with Harvard sociology graduate stu</span>ICT - Information and Communication Theory Grouphttp://ict.ewi.tudelft.nl/index.php?option=com_sections&id=178&Itemid=328hotho2009-01-19T21:22:47+01:00dataset folksonomy librarything tagging <a itemprop="url" data-versiondate="2009-01-19T21:22:47+01:00" href="http://ict.ewi.tudelft.nl/index.php?option=com_sections&id=178&Itemid=328" rel="nofollow" class="description-link">http://ict.ewi.tudelft.nl/index.php?option=com_sections&id=178&Itemid=328</a>Public Data Sets on Amazon Web Services (AWS)http://aws.amazon.com/publicdatasets/hotho2009-01-06T18:07:54+01:00amazon dataset ontology public <a itemprop="url" data-versiondate="2009-01-06T18:07:54+01:00" href="http://aws.amazon.com/publicdatasets/" rel="nofollow" class="description-link">http://aws.amazon.com/publicdatasets/</a>BibSonomy::faqhttp://www.bibsonomy.org/faq#faq-dataset-1stumme2008-11-28T11:01:10+01:00bibsonomy dataset dump <a itemprop="url" data-versiondate="2008-11-28T11:01:10+01:00" href="http://www.bibsonomy.org/faq#faq-dataset-1" rel="nofollow" class="description-link">http://www.bibsonomy.org/faq#faq-dataset-1</a>ICWSM 2009 - International AAAI Conference on Weblogs and Social Mediahttp://www.icwsm.org/2009/data/hotho2008-10-23T20:45:36+02:002009 blog challenge conference data dataset social web <a itemprop="url" data-versiondate="2008-10-23T20:45:36+02:00" href="http://www.icwsm.org/2009/data/" rel="nofollow" class="description-link">http://www.icwsm.org/2009/data/</a>Some code and datasetshttp://www.kyb.mpg.de/bs/people/pgehler/code/index.htmlhotho2008-10-10T17:20:02+02:00clustering code matlab plsa dataset <a itemprop="url" data-versiondate="2008-10-10T17:20:02+02:00" href="http://www.kyb.mpg.de/bs/people/pgehler/code/index.html" rel="nofollow" class="description-link">http://www.kyb.mpg.de/bs/people/pgehler/code/index.html</a>Show Us a Better Way: What public data is already available?http://www.showusabetterway.co.uk/call/data.htmlhotho2008-07-03T14:42:07+02:00data dataset public <a itemprop="url" data-versiondate="2008-07-03T14:42:07+02:00" href="http://www.showusabetterway.co.uk/call/data.html" rel="nofollow" class="description-link">http://www.showusabetterway.co.uk/call/data.html</a>Web Community Datasethttp://affsys.com/experiments/HT2008/hotho2008-06-21T20:33:47+02:00community dataset ht08 hypertext08 web <a itemprop="url" data-versiondate="2008-06-21T20:33:47+02:00" href="http://affsys.com/experiments/HT2008/" rel="nofollow" class="description-link">http://affsys.com/experiments/HT2008/</a>David Lee's Bookmarks for Corpus-based Linguistshttp://devoted.to/corporahotho2008-04-29T15:03:05+02:00corpus dataset lecture nlp survey <a itemprop="url" data-versiondate="2008-04-29T15:03:05+02:00" href="http://devoted.to/corpora" rel="nofollow" class="description-link">http://devoted.to/corpora</a>Geoffrey Sampson: Downloadable Resourceshttp://www.grsampson.net/Resources.htmlhotho2008-04-29T12:09:45+02:00corpus dataset lecture nlp tm <a itemprop="url" data-versiondate="2008-04-29T12:09:45+02:00" href="http://www.grsampson.net/Resources.html" rel="nofollow" class="description-link">http://www.grsampson.net/Resources.html</a>Linguist List - Web Resource Listingshttp://www.linguistlist.org/sp/Texts.htmlhotho2008-04-29T12:06:42+02:00corpus dataset lecture nlp <a itemprop="url" data-versiondate="2008-04-29T12:06:42+02:00" href="http://www.linguistlist.org/sp/Texts.html" rel="nofollow" class="description-link">http://www.linguistlist.org/sp/Texts.html</a>Home Page for 20 Newsgroups Data SetThe 20 Newsgroups data sethttp://people.csail.mit.edu/jrennie/20Newsgroups/hotho2008-04-12T15:32:30+02:0020 dataset newsgroups text <span itemprop="description">The 20 Newsgroups data set</span>20 Newsgroups20 Newsgroups
Abstract
This data set consists of 20000 messages taken from 20 Usenet newsgroups.
Information files:
description of the data
Data files:
20_newsgroups.tar.gz (17.3M; 61.6M uncompressed)
mini_newsgroups.tar.gz A subset composed of 100 articles from each newsgroup. (1.9M; 6.2M uncompressed)http://kdd.ics.uci.edu/databases/20newsgroups/20newsgroups.htmlhotho2008-04-12T15:32:12+02:0020 dataset newsgroups text <span itemprop="description">20 Newsgroups
Abstract
This data set consists of 20000 messages taken from 20 Usenet newsgroups.
Information files:
description of the data
Data files:
20_newsgroups.tar.gz (17.3M; 61.6M uncompressed)
mini_newsgroups.tar.gz A subset composed of 100 articles from each newsgroup. (1.9M; 6.2M uncompressed)</span>Trust network datasets - TrustLethttp://www.trustlet.org/wiki/Trust_network_datasetshotho2008-02-14T09:48:49+01:00dataset network <a itemprop="url" data-versiondate="2008-02-14T09:48:49+01:00" href="http://www.trustlet.org/wiki/Trust_network_datasets" rel="nofollow" class="description-link">http://www.trustlet.org/wiki/Trust_network_datasets</a>Google Research Homehttp://research.google.com/hotho2008-01-22T10:27:09+01:00data dataset google research <a itemprop="url" data-versiondate="2008-01-22T10:27:09+01:00" href="http://research.google.com/" rel="nofollow" class="description-link">http://research.google.com/</a>LETOR: Benchmark Datasets for Learning to Rankhttp://research.microsoft.com/users/tyliu/LETOR/hotho2008-01-01T13:56:17+01:00benchmark dataset learning microsoft ranking <a itemprop="url" data-versiondate="2008-01-01T13:56:17+01:00" href="http://research.microsoft.com/users/tyliu/LETOR/" rel="nofollow" class="description-link">http://research.microsoft.com/users/tyliu/LETOR/</a>The QWS Datasethttp://www.uoguelph.ca/~qmahmoud/qws/hotho2007-12-07T21:02:40+01:00answer dataset question semantic service web <a itemprop="url" data-versiondate="2007-12-07T21:02:40+01:00" href="http://www.uoguelph.ca/~qmahmoud/qws/" rel="nofollow" class="description-link">http://www.uoguelph.ca/~qmahmoud/qws/</a>Multilabel ClassificationMulti-Label Classificationhttp://mlkd.csd.auth.gr/multilabel.htmlhotho2007-11-23T13:12:59+01:00classification dataset extension multilabel text tools weka <span itemprop="description">Multi-Label Classification</span>Multexthttp://aune.lpl.univ-aix.fr/projects/multext/hotho2007-11-16T17:36:20+01:00corpus dataset text <a itemprop="url" data-versiondate="2007-11-16T17:36:20+01:00" href="http://aune.lpl.univ-aix.fr/projects/multext/" rel="nofollow" class="description-link">http://aune.lpl.univ-aix.fr/projects/multext/</a>Index of /WBS/seb/datasetshttp://www.aifb.uni-karlsruhe.de/WBS/seb/datasets/hotho2007-09-20T12:10:48+02:00dataset relation <a itemprop="url" data-versiondate="2007-09-20T12:10:48+02:00" href="http://www.aifb.uni-karlsruhe.de/WBS/seb/datasets/" rel="nofollow" class="description-link">http://www.aifb.uni-karlsruhe.de/WBS/seb/datasets/</a>Stanford Computer Sciencehttp://cs.stanford.edu/research/project.php?id=121hotho2007-07-19T01:31:59+02:00crawl dataset web <a itemprop="url" data-versiondate="2007-07-19T01:31:59+02:00" href="http://cs.stanford.edu/research/project.php?id=121" rel="nofollow" class="description-link">http://cs.stanford.edu/research/project.php?id=121</a>Datasetshttp://www.yr-bcn.es/webspam/datasets/hotho2007-07-19T01:15:17+02:00dataset detection spam webspam <a itemprop="url" data-versiondate="2007-07-19T01:15:17+02:00" href="http://www.yr-bcn.es/webspam/datasets/" rel="nofollow" class="description-link">http://www.yr-bcn.es/webspam/datasets/</a>Enron Email Datasethttp://www.cs.cmu.edu/~enron/hotho2007-05-18T20:38:46+02:00KI2007WebMining dataset email enron <a itemprop="url" data-versiondate="2007-05-18T20:38:46+02:00" href="http://www.cs.cmu.edu/~enron/" rel="nofollow" class="description-link">http://www.cs.cmu.edu/~enron/</a>ECML/PKDD Discovery Challenge 2006http://www.ecmlpkdd2006.org/challenge.htmlhotho2007-05-18T20:38:05+02:00KI2007WebMining dataset detection email spam <a itemprop="url" data-versiondate="2007-05-18T20:38:05+02:00" href="http://www.ecmlpkdd2006.org/challenge.html" rel="nofollow" class="description-link">http://www.ecmlpkdd2006.org/challenge.html</a>LETOR: Benchmark Data Sets for Learning to Rankhttp://research.microsoft.com/research/downloads/details/22a1b3e9-c5c6-4cfe-86f9-1d2ea1c199e8/details.aspxhotho2007-04-17T09:15:32+02:00benchmark dataset ranking <a itemprop="url" data-versiondate="2007-04-17T09:15:32+02:00" href="http://research.microsoft.com/research/downloads/details/22a1b3e9-c5c6-4cfe-86f9-1d2ea1c199e8/details.aspx" rel="nofollow" class="description-link">http://research.microsoft.com/research/downloads/details/22a1b3e9-c5c6-4cfe-86f9-1d2ea1c199e8/details.aspx</a>Web Information Retrieval / Natural Language Processing Group (WING) - NLP/IR resource page on ayehttp://wing.comp.nus.edu.sg/portal/RPNLPIR/hotho2007-03-23T15:16:48+01:00dataset information ir nlp resource retrieval web <a itemprop="url" data-versiondate="2007-03-23T15:16:48+01:00" href="http://wing.comp.nus.edu.sg/portal/RPNLPIR/" rel="nofollow" class="description-link">http://wing.comp.nus.edu.sg/portal/RPNLPIR/</a>Researchers Yearn to Use AOL Logs, but They Hesitate - New York Timeshttp://www.nytimes.com/2006/08/23/technology/23search.html?ei=5088&en=cc878412ed34dad0&ex=1313985600&partner=rssnyt&emc=rss&pagewanted=allhotho2007-02-19T12:49:31+01:00presse dataset aol <a itemprop="url" data-versiondate="2007-02-19T12:49:31+01:00" href="http://www.nytimes.com/2006/08/23/technology/23search.html?ei=5088&en=cc878412ed34dad0&ex=1313985600&partner=rssnyt&emc=rss&pagewanted=all" rel="nofollow" class="description-link">http://www.nytimes.com/2006/08/23/technology/23search.html?ei=5088&en=cc878412ed34dad0&ex=1313985600&partner=rssnyt&emc=rss&pagewanted=all</a>Datasets from transcripts of US Congressional floor debatesCongressional speech datahttp://www.cs.cornell.edu/home/llee/data/convote.htmlhotho2007-02-06T21:26:30+01:00classification dataset text <span itemprop="description">Congressional speech data</span>comp.lang.perl.modules | Google Groupshttp://groups.google.com/group/comp.lang.perl.modules/browse_thread/thread/619db8926623c188/dd4500f068555338?lnk=st&q=perl+mysql+large+datasets&rnum=14&hl=en#dd4500f068555338hotho2007-02-01T10:41:52+01:00perl large mysql dataset <a itemprop="url" data-versiondate="2007-02-01T10:41:52+01:00" href="http://groups.google.com/group/comp.lang.perl.modules/browse_thread/thread/619db8926623c188/dd4500f068555338?lnk=st&q=perl+mysql+large+datasets&rnum=14&hl=en#dd4500f068555338" rel="nofollow" class="description-link">http://groups.google.com/group/comp.lang.perl.modules/browse_thread/thread/619db8926623c188/dd4500f068555338?lnk=st&q=perl+mysql+large+datasets&rnum=14&hl=en#dd4500f068555338</a>ACM SIGKDD: Special Issue on Learning from Inbalanced Datasetshttp://www.acm.org/sigs/sigkdd/explorations/issue.php?volume=6&issue=1&year=2004&month=06hotho2007-01-28T16:19:49+01:00data dataset inbalanced learning svm <a itemprop="url" data-versiondate="2007-01-28T16:19:49+01:00" href="http://www.acm.org/sigs/sigkdd/explorations/issue.php?volume=6&issue=1&year=2004&month=06" rel="nofollow" class="description-link">http://www.acm.org/sigs/sigkdd/explorations/issue.php?volume=6&issue=1&year=2004&month=06</a>Pajek / How to: Convert text file datasets into Pajek formathttp://vlado.fmf.uni-lj.si/pub/networks/pajek/howto/text2pajek.htmhotho2007-01-26T13:34:34+01:00convert dataset pajek <a itemprop="url" data-versiondate="2007-01-26T13:34:34+01:00" href="http://vlado.fmf.uni-lj.si/pub/networks/pajek/howto/text2pajek.htm" rel="nofollow" class="description-link">http://vlado.fmf.uni-lj.si/pub/networks/pajek/howto/text2pajek.htm</a>CLUTO - Family of Data Clustering Software Tools | Karypis Labhttp://glaros.dtc.umn.edu/gkhome/views/clutohotho2006-10-25T09:25:47+02:00clustering tools dataset dm ml <a itemprop="url" data-versiondate="2006-10-25T09:25:47+02:00" href="http://glaros.dtc.umn.edu/gkhome/views/cluto" rel="nofollow" class="description-link">http://glaros.dtc.umn.edu/gkhome/views/cluto</a>Learning Question Classifiershttp://l2r.cs.uiuc.edu/~cogcomp/Data/QA/QC/hotho2006-10-11T10:27:47+02:00qa classification dataset <a itemprop="url" data-versiondate="2006-10-11T10:27:47+02:00" href="http://l2r.cs.uiuc.edu/~cogcomp/Data/QA/QC/" rel="nofollow" class="description-link">http://l2r.cs.uiuc.edu/~cogcomp/Data/QA/QC/</a>AOL search data mirrorsThis collection consists of ~20M web queries collected from ~650k users over three months.
The data is sorted by anonymous user ID and sequentially arranged.http://www.gregsadetsky.com/aol-data/hotho2006-10-07T11:43:25+02:00search dataset <span itemprop="description">This collection consists of ~20M web queries collected from ~650k users over three months.
The data is sorted by anonymous user ID and sequentially arranged.</span>Netflix Prize: Homehttp://www.netflixprize.com/hotho2006-10-05T22:08:28+02:00recommender movie dataset preis <a itemprop="url" data-versiondate="2006-10-05T22:08:28+02:00" href="http://www.netflixprize.com/" rel="nofollow" class="description-link">http://www.netflixprize.com/</a>Bibliography Imbalance Problemhttp://www.site.uottawa.ca/~nat/Research/class_imbalance_bibli.htmlhotho2006-09-19T12:09:43+02:00data dataset paper imbalance <span itemprop="description"> Imbalance Problem</span>Trec Spam Corpushttp://plg.uwaterloo.ca/~gvcormac/treccorpus/hotho2006-09-04T15:42:51+02:00trec spam set data dataset corpus <a itemprop="url" data-versiondate="2006-09-04T15:42:51+02:00" href="http://plg.uwaterloo.ca/~gvcormac/treccorpus/" rel="nofollow" class="description-link">http://plg.uwaterloo.ca/~gvcormac/treccorpus/</a>Where's George? ® 2.2http://www.wheresgeorge.com/hotho2006-09-04T15:42:51+02:00dollar dataset <a itemprop="url" data-versiondate="2006-09-04T15:42:51+02:00" href="http://www.wheresgeorge.com/" rel="nofollow" class="description-link">http://www.wheresgeorge.com/</a>Seuchen-Prognose: Forscher finden das Gesetz des Reisens - Wissenschaft - SPIEGEL ONLINE - Nachrichtenhttp://www.spiegel.de/wissenschaft/mensch/0,1518,397303,00.htmlhotho2006-09-04T15:42:51+02:00bewegung dollar dataset reise vorhersagen <a itemprop="url" data-versiondate="2006-09-04T15:42:51+02:00" href="http://www.spiegel.de/wissenschaft/mensch/0,1518,397303,00.html" rel="nofollow" class="description-link">http://www.spiegel.de/wissenschaft/mensch/0,1518,397303,00.html</a>Miscellaneous MATLAB Software, Data, Tricks and DemonstrationsGunnar Raetsch's Benchmark Datasetshttp://theoval.sys.uea.ac.uk/matlab/default.html#benchmarkshotho2006-06-23T09:00:57+02:00benchmark dataset dm matlab ml kernel <span itemprop="description">Gunnar Raetsch's Benchmark Datasets</span>Algorithms for Large Data Sets: Lecture Notes & Slideshttp://www.ee.technion.ac.il/courses/049011/index_files/Page337.htmlhotho2006-06-23T07:42:47+02:00folien ir large dataset <a itemprop="url" data-versiondate="2006-06-23T07:42:47+02:00" href="http://www.ee.technion.ac.il/courses/049011/index_files/Page337.html" rel="nofollow" class="description-link">http://www.ee.technion.ac.il/courses/049011/index_files/Page337.html</a>Benchmark Data Sets used in [RaeOnoMue01] and [MikRaeWesSchMue99]http://ida.first.fraunhofer.de/projects/bench/benchmarks.htmhotho2006-06-23T07:24:21+02:00dataset dm ida ml <a itemprop="url" data-versiondate="2006-06-23T07:24:21+02:00" href="http://ida.first.fraunhofer.de/projects/bench/benchmarks.htm" rel="nofollow" class="description-link">http://ida.first.fraunhofer.de/projects/bench/benchmarks.htm</a>Datasetshttp://www.niaad.liacc.up.pt/old/statlog/datasets.htmlhotho2006-06-23T07:23:30+02:00statlog dataset dm ml <a itemprop="url" data-versiondate="2006-06-23T07:23:30+02:00" href="http://www.niaad.liacc.up.pt/old/statlog/datasets.html" rel="nofollow" class="description-link">http://www.niaad.liacc.up.pt/old/statlog/datasets.html</a>UCI Machine Learning Repositoryhttp://www.ics.uci.edu/~mlearn/MLRepository.htmlhotho2006-06-23T07:18:45+02:00learning data dataset dm mining machine ml uci <a itemprop="url" data-versiondate="2006-06-23T07:18:45+02:00" href="http://www.ics.uci.edu/~mlearn/MLRepository.html" rel="nofollow" class="description-link">http://www.ics.uci.edu/~mlearn/MLRepository.html</a>Delve Datasetshttp://www.cs.toronto.edu/~delve/data/datasets.htmlhotho2006-06-23T07:18:31+02:00learning data delve dataset dm mining machine ml <a itemprop="url" data-versiondate="2006-06-23T07:18:31+02:00" href="http://www.cs.toronto.edu/~delve/data/datasets.html" rel="nofollow" class="description-link">http://www.cs.toronto.edu/~delve/data/datasets.html</a>Martin Hepphttp://www.heppnetz.de/eclassowl/hotho2006-06-19T10:00:33+02:00ontology dataset <a itemprop="url" data-versiondate="2006-06-19T10:00:33+02:00" href="http://www.heppnetz.de/eclassowl/" rel="nofollow" class="description-link">http://www.heppnetz.de/eclassowl/</a>Omega Ontology: Homehttp://omega.isi.edu/hotho2006-06-14T06:19:56+02:00ontology omega dataset nlp <a itemprop="url" data-versiondate="2006-06-14T06:19:56+02:00" href="http://omega.isi.edu/" rel="nofollow" class="description-link">http://omega.isi.edu/</a>Welcome to the UCR Time Series Classification/Clustering PageWelcome to the UCR Time Series Classification/Clustering Pagehttp://www.cs.ucr.edu/~eamonn/time_series_data/hotho2006-06-02T18:24:45+02:00dataset <span itemprop="description">Welcome to the UCR Time Series Classification/Clustering Page</span>HepCorpus - Sinaihttp://sinai.ujaen.es/wiki/index.php/HepCorpus#English_versionhotho2006-05-29T15:53:16+02:00text dataset corpus <a itemprop="url" data-versiondate="2006-05-29T15:53:16+02:00" href="http://sinai.ujaen.es/wiki/index.php/HepCorpus#English_version" rel="nofollow" class="description-link">http://sinai.ujaen.es/wiki/index.php/HepCorpus#English_version</a>Manuel Barbera, Corpus based computational linguistic resources. General: E-Texts (§ 2.3).Electronic Literary Text Archives.http://www.bmanuel.org/clr2_et.htmlhotho2006-05-26T08:21:51+02:00text dataset corpus <span itemprop="description">Electronic Literary Text Archives.</span>datasethttp://www.informatics.bangor.ac.uk/~kuncheva/activities/artificial_data.htmhotho2006-05-24T14:14:08+02:00clustering dataset <a itemprop="url" data-versiondate="2006-05-24T14:14:08+02:00" href="http://www.informatics.bangor.ac.uk/~kuncheva/activities/artificial_data.htm" rel="nofollow" class="description-link">http://www.informatics.bangor.ac.uk/~kuncheva/activities/artificial_data.htm</a>Fundamental Clustering Problem Suite | DatabionicsFundamental Clustering Problem Suitehttp://www.mathematik.uni-marburg.de/~databionics/en//?q=datahotho2006-05-24T14:13:36+02:00clustering dataset <span itemprop="description">Fundamental Clustering Problem Suite</span>Andrew McCallum's Code and DataCora Citation Matching [reference matching, object correspondence]
Text of citations hand-clustered into groups referring to the same paper.http://www.cs.umass.edu/~mccallum/code-data.htmlhotho2006-05-11T09:55:41+02:00ie dataset bibliographic references cora <span itemprop="description">Cora Citation Matching [reference matching, object correspondence]
Text of citations hand-clustered into groups referring to the same paper.</span>Lost Boy: SPARQLing the BBC Programme Cataloguehttp://www.ldodds.com/blog/archives/000272.htmlhotho2006-04-27T12:05:58+02:00data dataset rdf <a itemprop="url" data-versiondate="2006-04-27T12:05:58+02:00" href="http://www.ldodds.com/blog/archives/000272.html" rel="nofollow" class="description-link">http://www.ldodds.com/blog/archives/000272.html</a>much.moreA number of resources have been compiled within the context of the MuchMore project. These include: a bilingual, parallel medical corpus; corresponding queries and relevance assessments; evaluation sets of disambiguated terms for GermaNet and UMLS; an evaluation list for morphological decomposition of medical terms.http://muchmore.dfki.de/resources_index.htmhotho2006-04-07T10:58:58+02:00dataset corpus <span itemprop="description">A number of resources have been compiled within the context of the MuchMore project. These include: a bilingual, parallel medical corpus; corresponding queries and relevance assessments; evaluation sets of disambiguated terms for GermaNet and UMLS; an evaluation list for morphological decomposition of medical terms.</span>SourceForge.net: FilesNew text datasets (donated by George Forman) are available for download on Sourceforge:http://sourceforge.net/project/showfiles.php?group_id=5091&package_id=95362&release_id=399264hotho2006-03-07T08:26:04+01:00weka text dataset <span itemprop="description">New text datasets (donated by George Forman) are available for download on Sourceforge:</span>Obtaining corpora and text collections for biomedical natural language processinghttp://compbio.uchsc.edu/corpora/obtaining.shtmlhotho2006-01-31T18:10:51+01:00dataset nlp bio <a itemprop="url" data-versiondate="2006-01-31T18:10:51+01:00" href="http://compbio.uchsc.edu/corpora/obtaining.shtml" rel="nofollow" class="description-link">http://compbio.uchsc.edu/corpora/obtaining.shtml</a>Tagged datasets for named entity recognition taskshttp://www.cs.technion.ac.il/~gabr/resources/data/ne_datasets.htmlhotho2006-01-31T15:41:07+01:00named dataset entity nlp <a itemprop="url" data-versiondate="2006-01-31T15:41:07+01:00" href="http://www.cs.technion.ac.il/~gabr/resources/data/ne_datasets.html" rel="nofollow" class="description-link">http://www.cs.technion.ac.il/~gabr/resources/data/ne_datasets.html</a>