%0 %0 Conference Proceedings %A Atzmueller, Martin; Lemmerich, Florian; Krause, Beate & Hotho, Andreas %D 2009 %T Who are the Spammers? Understandable Local Patterns for Concept Description %E %B 7th Conference on Computer Methods and Systems %C Krakow, Poland %I %V %6 %N %P %& %Y %S %7 %8 November %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 %# %$ %F atze09 %K 2009, bibsonomy, discovery, dm, myown, spam, subgroup %X %Z ISBN 83-916420-5-4 %U http://www.cms.agh.edu.pl/ %+ %^ %0 %0 Conference Proceedings %A %D 2000 %T Web Usage Analysis and User Profiling, International WEBKDD'99 Workshop, San Diego, California, USA, August 15, 1999, Revised Papers %E Masand, Brij M. & Spiliopoulou, Myra %B WEBKDD %C %I Springer %V 1836 %6 %N %P %& %Y %S Lecture Notes in Computer Science %7 %8 %9 %? %! %Z %@ 3-540-67818-2 %( %) %* %L %M %1 %2 DBLP Record 'conf/kdd/1999web' %3 proceedings %4 %# %$ %F DBLP:conf/kdd/1999web %K KI2007WebMining, dm, kdd, mining, web, webkdd, workshop %X %Z %U %+ %^ %0 %0 Journal Article %A Hotho, Andreas; Ulslev Pedersen,, Rasmus & Wurst, Michael %D 2010 %T Ubiquitous Data %E %B Lecture Notes in Computer Science %C %I Springer %V %6 %N 6202 %P 61--74 %& %Y %S %7 %8 %9 %? %! %Z %@ 0302-9743 %( %) %* %L %M %1 %2 %3 article %4 %# %$ %F hotho2010ubiquitous %K 2010, data, dm, mining, myown, social, ubiquitous %X %Z %U http://rd.springer.com/content/pdf/10.1007%2F978-3-642-16392-0_4.pdf %+ %^ %0 %0 Conference Proceedings %A Atzmueller, Martin; Lemmerich, Florian; Krause, Beate & Hotho, Andreas %D 2009 %T Towards Understanding Spammers - Discovering Local Patterns for Concept Characterization and Description %E Knobbe, Johannes F\"urnkranz Arno %B Proc. LeGo-09: From Local Patterns to Global Models, Workshop at the 2009 European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases %C %I %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inproceedings %4 %# %$ %F ALKH:09 %K 2009, bibsonomy, discovery, dm, myown, spam, subgroup %X %Z accepted %U http://www.ke.tu-darmstadt.de/events/LeGo-09/04-Atzmueller.pdf %+ %^ %0 %0 Journal Article %A Baeza-Yates, Ricardo; Calderón-Benavides, Liliana & González-Caro, Cristina %D 2006 %T The Intention Behind Web Queries %E %B String Processing and Information Retrieval %C %I %V %6 %N %P 98--109 %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 SpringerLink - Buchkapitel %3 article %4 %# %$ %F keyhere %K analysis, dm, intention, ml, query, search, toread %X The identification of the user’s intention or interest through queries that they submit to a search engine can be very useful to offer them more adequate results. In this work we present a framework for the identification of user’s interest in an automaticway, based on the analysis of query logs. This identification is made from two perspectives, the objectives or goals of auser and the categories in which these aims are situated. A manual classification of the queries was made in order to havea reference point and then we applied supervised and unsupervised learning techniques. The results obtained show that fora considerable amount of cases supervised learning is a good option, however through unsupervised learning we found relationshipsbetween users and behaviors that are not easy to detect just taking the query words. Also, through unsupervised learning weestablished that there are categories that we are not able to determine in contrast with other classes that were not consideredbut naturally appear after the clustering process. This allowed us to establish that the combination of supervised and unsupervisedlearning is a good alternative to find user’s goals. From supervised learning we can identify the user interest given certainestablished goals and categories; on the other hand, with unsupervised learning we can validate the goals and categories used,refine them and select the most appropriate to the user’s needs. %Z %U http://dx.doi.org/10.1007/11880561_9 %+ %^ %0 %0 Conference Proceedings %A Krause, Beate; Schmitz, Christoph; Hotho, Andreas & Stumme, Gerd %D 2008 %T The Anti-Social Tagger - Detecting Spam in Social Bookmarking Systems %E %B AIRWeb '08: Proceedings of the 4th international workshop on Adversarial information retrieval on the web %C New York, NY, USA %I ACM %V %6 %N %P 61--68 %& %Y %S %7 %8 %9 %? %! %Z %@ 978-1-60558-159-0 %( %) %* %L %M %1 %2 %3 inproceedings %4 %# %$ %F anti2008krause %K 2008, bookmarking, classification, dm, folksonomy, mining, ml, myown, social, spam %X %Z %U http://airweb.cse.lehigh.edu/2008/submissions/krause_2008_anti_social_tagger.pdf %+ %^ %0 %0 Book %A Weiss, Sholom M.; Indurkhya, Nitin & Zhang, T. %D 2004 %T Text Mining. Predictive Methods for Analyzing Unstructured Information %E %B %C %I Springer, Berlin %V %6 %N %P %& %Y %S %7 1 %8 %9 %? %! %Z %@ 0387954333 %( %) %* %L %M %1 %2 Amazon.de: Text Mining. Predictive Methods for Analyzing Unstructured Information: Sholom M. Weiss,Nitin Indurkhya,T. Zhang: English Books %3 book %4 %# %$ %F 0387954333 %K dm, mining, nlp, software, text, tm %X %Z %U http://www.amazon.de/gp/redirect.html%3FASIN=0387954333%26tag=ws%26lcode=xm2%26cID=2025%26ccmID=165953%26location=/o/ASIN/0387954333%253FSubscriptionId=13CT5CVB80YFWJEPWS02 %+ %^ %0 %0 Conference Proceedings %A Bullock, Beate Navarro; Jäschke, Robert & Hotho, Andreas %D 2011 %T Tagging data as implicit feedback for learning-to-rank %E %B Proceedings of the ACM WebSci'11 %C %I %V %6 %N %P %& %Y %S %7 %8 June %9 %? %! %Z %@ %( %) %* %L %M %1 %2 Poster in the Web Science Repository %3 inproceedings %4 %# %$ %F bullock2011tagging %K 2011, dm, feedback, learning, logsonomy, ml, myown, search, social %X %Z %U http://journal.webscience.org/463/ %+ %^ %0 %0 Book %A Thrun, Sebastian; Burgard, Wolfram & Fox, Dieter %D 2001 %T Probabilistic Robotics (Intelligent Robotics and Autonomous Agents) %E %B %C %I %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 Amazon.com: Probabilistic Robotics (Intelligent Robotics and Autonomous Agents): Books: Sebastian Thrun,Wolfram Burgard,Dieter Fox %3 book %4 %# %$ %F thrun2001 %K dm, learning, machine, ml, probabilistic %X %Z %U http://www.amazon.com/Probabilistic-Robotics-Intelligent-Autonomous-Agents/dp/0262201623/ref=sr_11_1/105-3361811-4085215?ie=UTF8&qid=1190743235&sr=11-1 %+ %^ %0 %0 Journal Article %A Flajolet, Philippe & Martin, G. Nigel %D 1985 %T Probabilistic Counting Algorithms for Data Base Applications %E %B Journal of Computer and System Sciences %C %I %V 31 %6 %N 2 %P 182-209 %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 Probabilistic Counting Algorithms for Data Base Applications - Flajolet, Martin (ResearchIndex) %3 article %4 %# %$ %F flajolet85probabilistic %K association, counting, dm, kdubiq, rule, toread %X %Z %U http://citeseer.ist.psu.edu/flajolet85probabilistic.html %+ %^ %0 %0 Book %A %D 2005 %T Ontology Learning from Text: Methods, Evaluation and Applications %E Buitelaar, Paul; Cimiano, Philipp & Magnini, Bernardo %B Frontiers in Artificial Intelligence %C %I IOS Press %V 123 %6 %N %P %& %Y %S %7 %8 JUL %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 book %4 %# %$ %F buitelaar05ontologylearningbook %K dm, learning, ml, ol, semantic, web %X %Z %U %+ %^ %0 %0 Book Section %A Baldi, Pierre; Frasconi, Paolo & Smyth, Padhraic %D 2003 %T Modeling the Internet and the Web: Probabilistic Methods and Algorithms %E %B Modeling the Internet and the Web: Probabilistic Methods and Algorithms %C %I Wiley %V %6 %N %P %& %Y %S %7 %8 April %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 inbook %4 %# %$ %F baldi03modelling %K KI2007WebMining, dm, kdd, mining, ml, web %X Modeling the Internet and the Web covers the most important aspects of modeling the Web using a modern mathematical and probabilistic treatment. It focuses on the information and application layers, as well as some of the emerging properties of the Internet.  Provides a comprehensive introduction to the modeling of the Internet and the Web at the information level.  Takes a modern approach based on mathematical, probabilistic, and graphical modeling.  Provides an integrated presentation of theory, examples, exercises and applications.  Covers key topics such as text analysis, link analysis, crawling techniques, human behaviour, and commerce on the Web. Interdisciplinary in nature, Modeling the Internet and the Web will be of interest to students and researchers from a variety of disciplines including computer science, machine learning, engineering, statistics, economics, business, and the social sciences. %Z %U http://eu.wiley.com/WileyCDA/WileyTitle/productCd-0470849061.html %+ %^ %0 %0 Journal Article %A Song, Chaoming; Qu, Zehui; Blumm, Nicholas & Barabási, Albert-László %D 2010 %T Limits of Predictability in Human Mobility %E %B Science %C %I %V 327 %6 %N 5968 %P 1018-1021 %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 article %4 %# %$ %F Song19022010 %K dm, everyaware, position, prediction, toread, vorhersage %X A range of applications, from predicting the spread of human and electronic viruses to city planning and resource management in mobile communications, depend on our ability to foresee the whereabouts and mobility of individuals, raising a fundamental question: To what degree is human behavior predictable? Here we explore the limits of predictability in human dynamics by studying the mobility patterns of anonymized mobile phone users. By measuring the entropy of each individual’s trajectory, we find a 93% potential predictability in user mobility across the whole user base. Despite the significant differences in the travel patterns, we find a remarkable lack of variability in predictability, which is largely independent of the distance users cover on a regular basis. %Z %U http://www.sciencemag.org/content/327/5968/1018.abstract %+ %^ %0 %0 Journal Article %A Morstatter, Fred; \"u,rgen Pfeffer, J; Liu, Huan & Carley, Kathleen M %D 2013 %T Is the Sample Good Enough? Comparing Data from Twitter’s Streaming API with Twitter’s Firehose %E %B %C %I %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 article %4 %# %$ %F morstatter2013sample %K comparing, dm, ml, sample, toread, twitters %X %Z %U http://scholar.google.de/scholar.bib?q=info:NkS2afIrqyQJ:scholar.google.com/&output=citation&hl=de&as_sdt=0,5&ct=citation&cd=0 %+ %^ %0 %0 Book %A %D 2007 %T From Web to Social Web: Discovering and Deploying User and Content Profiles %E Berendt, B.; Hotho, A.; Mladenic, D. & Semeraro, G. %B LNCS %C %I Springer %V 4736 %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ 978-3-540-74950-9 %( %) %* %L %M %1 %2 From Web to Social Web: Discovering and Deploying User and Cont... - Data Mi...Journals, Books & Online Media | Springer %3 book %4 %# %$ %F Berendt2007 %K 2007, data, dm, mining, myown, social, tm, web %X This book constitutes the refereed proceedings of the Workshop on Web Mining, WebMine 2006, held in Berlin, Germany, September 18th, 2006. Topics included are data mining based on analysis of bloggers and tagging, web mining, XML mining and further techniques of knowledge discovery. The book is especially valuable for those interested in the aspects of the Social Web (Web 2.0) and its inherent dynamic and diversity of user-generated content. %Z %U http://www.springer.com/dal/home?SGWID=1-102-22-173759307-0&changeHeader=true&referer=www.springeronline.com&SHORTCUT=www.springer.com/978-3-540-74950-9 %+ %^ %0 %0 Book Section %A Fayyad, Usama M.; Piatetsky-Shapiro, Gregory & Smyth, Padhraic %D 1996 %T From Data Mining to Knowledge Discovery: An Overview. %E %B Advances in Knowledge Discovery and Data Mining %C %I %V %6 %N %P 1-34 %& %Y %S %7 %8 %9 %? %! %Z %@ %( %) %* %L %M %1 %2 dblp %3 incollection %4 %# %$ %F books/mit/fayyadPSU96/FayyadPS96 %K definition, dm, kdd, ml %X %Z %U http://dblp.uni-trier.de/db/books/collections/fayyad96.html#FayyadPS96 %+ %^ %0 %0 Journal Article %A Wurst, Michael & Morik, Katharina %D 2007 %T Distributed feature extraction in a p2p setting: a case study %E %B Future Gener. Comput. Syst. %C %I Elsevier Science Publishers B. V. %V 23 %6 %N 1 %P 69--75 %& %Y %S %7 %8 %9 %? %! %Z %@ 0167-739X %( %) %* %L %M %1 %2 Distributed feature extraction in a p2p setting %3 article %4 %# %$ %F 1276056 %K 2.0, dm, kdubiq, mining, music, p2p, summerschool, tagging, taggingsurvey, web %X %Z %U http://portal.acm.org/citation.cfm?id=1276056 %+ %^ %0 %0 Conference Proceedings %A Balakrishnan, Hemant & Deo, Narsingh %D 2006 %T Discovering communities in complex networks. %E Menezes, Ronaldo %B ACM Southeast Regional Conference %C %I ACM %V %6 %N %P 280-285 %& %Y %S %7 %8 %9 %? %! %Z %@ 1-59593-315-8 %( %) %* %L %M %1 %2 dblp %3 inproceedings %4 conf/ACMse/2006 %# %$ %F conf/ACMse/BalakrishnanD06 %K clustering, communities, dm, graph, networks, soical, toread %X %Z %U http://www.cs.ucf.edu/csdept/faculty/deo/ACMSE-06.pdf %+ %^ %0 %0 Book %A Pyle, Dorian %D 1999 %T Data Preparation for Data Mining %E %B %C %I Morgan Kaufmann %V %6 %N %P %& %Y %S %7 %8 %9 %? %! %Z %@ 1-55860-529-0 %( %) %* %L %M %1 %2 dblp %3 book %4 %# %$ %F books/mk/Pyle99 %K data, dm, kdd, kdubiq, mining, preparation, wg4 %X %Z %U %+ %^ %0 %0 Journal Article %A Dhillon, Inderjit S.; Modha, Dharmendra S. & Spangler, W. Scott %D 2002 %T Class visualization of high-dimensional data with applications %E %B Computational Statistics \& Data Analysis %C %I %V 41 %6 %N 1 %P 59-90 %& %Y %S %7 %8 November %9 %? %! %Z %@ %( %) %* %L %M %1 %2 %3 article %4 %# %$ %F RePEc:eee:csdana:v:41:y:2002:i:1:p:59-90 %K alphaworks, cluster, clustering, dm, visualization %X No abstract is available for this item. %Z %U http://www.cs.utexas.edu/~inderjit/public_papers/csda.pdf %+ %^