TY - CHAP AU - Allen, R. AU - Wu, Yejun A2 - Lim, Ee A2 - Foo, Schubert A2 - Khoo, Chris A2 - Chen, Hsinchun A2 - Fox, Edward A2 - Urs, Shalini A2 - Costantino, Thanos T1 - Generality of Texts T2 - Digital Libraries: People, Knowledge, and Technology PB - Springer CY - Berlin / Heidelberg PY - 2010/ VL - 2555 IS - SP - 111 EP - 116 UR - http://dx.doi.org/10.1007/3-540-36227-4_11 M3 - 10.1007/3-540-36227-4_11 KW - genta11 L1 - SN - N1 - SpringerLink - Abstract N1 - AB - When searching or browsing, a user may be looking for either very general information or very specific information. We explored predictors for characterizing the generality of six encyclopedia texts. We had human subjects rank-order the generality of the texts. We also developed statistics from analysis of word frequency and from comparison to a set of reference terms. We found a statistically significant relationship between the human ratings of text generality and our automatic measure. ER - TY - CONF AU - Cheng, Weiwei AU - Rademaker, Michaël AU - Baets, Bernard De AU - Hüllermeier, Eyke A2 - Balcázar, José L. A2 - Bonchi, Francesco A2 - Gionis, Aristides A2 - Sebag, Michèle T1 - Predicting Partial Orders: Ranking with Abstention. T2 - ECML/PKDD (1) PB - Springer CY - PY - 2010/ M2 - VL - 6321 IS - SP - 215 EP - 230 UR - http://dblp.uni-trier.de/db/conf/pkdd/pkdd2010-1.html#ChengRBH10 M3 - KW - abstention KW - partial_orders KW - genta11 KW - ranking L1 - SN - 978-3-642-15879-7 N1 - N1 - AB - ER - TY - CHAP AU - Haridas, Mandar AU - Caragea, Doina A2 - Meersman, Robert A2 - Dillon, Tharam A2 - Herrero, Pilar T1 - Exploring Wikipedia and DMoz as Knowledge Bases for Engineering a User Interests Hierarchy for Social Network Applications T2 - On the Move to Meaningful Internet Systems: OTM 2009 PB - Springer CY - Berlin / Heidelberg PY - 2009/ VL - 5871 IS - SP - 1238 EP - 1245 UR - http://dx.doi.org/10.1007/978-3-642-05151-7_35 M3 - 10.1007/978-3-642-05151-7_35 KW - dmoz KW - genta11 KW - hierarchy KW - taxonomy KW - wordnet KW - ol_web2.0 KW - data_wikis KW - methods_concepthierarchy L1 - SN - N1 - SpringerLink - Abstract N1 - AB - The outgrowth of social networks in the recent years has resulted in opportunities for interesting data mining problems, such as interest or friendship recommendations. A global ontology over the interests specified by the users of a social network is essential for accurate recommendations. We propose, evaluate and compare three approaches to engineering a hierarchical ontology over user interests. The proposed approaches make use of two popular knowledge bases, Wikipedia and Directory Mozilla, to extract interest definitions and/or relationships between interests. More precisely, the first approach uses Wikipedia to find interest definitions, the latent semantic analysis technique to measure the similarity between interests based on their definitions, and an agglomerative clustering algorithm to group similar interests into higher level concepts. The second approach uses the Wikipedia Category Graph to extract relationships between interests, while the third approach uses Directory Mozilla to extract relationships between interests. Our results show that the third approach, although the simplest, is the most effective for building a hierarchy over user interests. ER - TY - JOUR AU - Landia, N. AU - Anand, S.S. T1 - Personalised Tag Recommendation JO - Recommender Systems & the Social Web PY - 2009/ VL - IS - SP - EP - UR - http://scholar.google.de/scholar.bib?q=info:GhmAwnjPyP0J:scholar.google.com/&output=citation&hl=de&as_sdt=2000&ct=citation&cd=0 M3 - KW - tag_generality KW - tag_recommender KW - genta11 KW - toread L1 - SN - N1 - N1 - AB - ER - TY - CONF AU - Yang, Hui AU - Callan, Jamie A2 - T1 - A Metric-based Framework for Automatic Taxonomy Induction T2 - Proceedings of the 47th Annual Meeting of the Association for Computational Linguistics (ACL2009) PB - CY - Singapore PY - 2009/08 M2 - VL - IS - SP - EP - UR - M3 - KW - genta11 KW - taxonomy_learning L1 - SN - N1 - N1 - AB - ER - TY - JOUR AU - Rayson, Paul T1 - From key words to key semantic domains JO - International Journal of Corpus Linguistics PY - December 2008/ VL - 13 IS - SP - 519 EP - 549(31) UR - http://www.ingentaconnect.com/content/jbp/ijcl/2008/00000013/00000004/art00005 M3 - 10.1075/ijcl.13.4.06ray KW - ol_web2.0 KW - semantics KW - genta11 KW - linguistics L1 - SN - N1 - ingentaconnect From key words to key semantic domains N1 - AB - This paper reports the extension of the key words method for the comparison of corpora. Using automatic tagging software that assigns part-of-speech and semantic field (domain) tags, a method is described which permits the extraction of key domains by applying the keyness calculation to tag frequency lists. The combination of the key words and key domains methods is shown to allow macroscopic analysis (the study of the characteristics of whole texts or varieties of language) to inform the microscopic level (focussing on the use of a particular linguistic feature) and thereby suggesting those linguistic features which should be investigated further. The resulting 'data-driven' approach presented here combines elements of both the 'corpus-based' and 'corpus-driven' paradigms in corpus linguistics. A web-based tool, Wmatrix, implementing the proposed method is applied in a case study: the comparison of UK 2001 general election manifestos of the Labour and Liberal Democratic parties. ER - TY - JOUR AU - Rayson, Paul T1 - From key words to key semantic domains JO - International Journal of Corpus Linguistics PY - 2008/ VL - 13 IS - SP - 519 EP - 549(31) UR - http://www.ingentaconnect.com/content/jbp/ijcl/2008/00000013/00000004/art00005 M3 - 10.1075/ijcl.13.4.06ray KW - background KW - genta11 KW - linguistics KW - ol_web2.0 KW - semantics L1 - SN - N1 - ingentaconnect From key words to key semantic domains N1 - AB - This paper reports the extension of the key words method for the comparison of corpora. Using automatic tagging software that assigns part-of-speech and semantic field (domain) tags, a method is described which permits the extraction of key domains by applying the keyness calculation to tag frequency lists. The combination of the key words and key domains methods is shown to allow macroscopic analysis (the study of the characteristics of whole texts or varieties of language) to inform the microscopic level (focussing on the use of a particular linguistic feature) and thereby suggesting those linguistic features which should be investigated further. The resulting 'data-driven' approach presented here combines elements of both the 'corpus-based' and 'corpus-driven' paradigms in corpus linguistics. A web-based tool, Wmatrix, implementing the proposed method is applied in a case study: the comparison of UK 2001 general election manifestos of the Labour and Liberal Democratic parties. ER - TY - CONF AU - Yan, Xin AU - Li, Xue AU - Song, Dawei A2 - T1 - Document generality: its computation for ranking T2 - ADC '06: Proceedings of the 17th Australasian Database Conference PB - Australian Computer Society, Inc. CY - Darlinghurst, Australia, Australia PY - 2006/ M2 - VL - IS - SP - 109 EP - 118 UR - http://portal.acm.org/citation.cfm?id=1151748 M3 - KW - genta11 L1 - SN - 1-920682-31-7 N1 - N1 - AB - The increased variety of information makes it critical to retrieve documents which are not only relevant but also broad enough to cover as many different aspects of a certain topic as possible. The increased variety of users also makes it critical to retrieve documents that are jargon free and easy-to-understand rather than the specific technical materials. In this paper, we propose a new concept namely document generality computation. Generality of document is of fundamental importance to information retrieval. Document generality is the state or quality of document being general. We compute document generality based on a domain-ontology method that analyzes scope and semantic cohesion of concepts appeared in the text. For test purposes, our proposed approach is then applied to improving the performance of document ranking in bio-medical information retrieval. The retrieved documents are re-ranked by a combined score of similarity and the closeness of documents' generality to that of a query. The experiments have shown that our method can work on a large scale bio-medical text corpus OHSUMED (Hersh, Buckley, Leone & Hickam 1994), which is a subset of MED-LINE collection containing of 348,566 medical journal references and 101 test queries, with an encouraging performance. ER - TY - JOUR AU - Clark, J.M. AU - Paivio, A. T1 - Extensions of the Paivio, Yuille, and Madigan (1968) norms JO - Behavior Research Methods, Instruments, & Computers PY - 2004/ VL - 36 IS - 3 SP - EP - UR - http://scholar.google.de/scholar.bib?q=info:CzAMgvcRAbwJ:scholar.google.com/&output=citation&hl=de&as_sdt=2000&as_vis=1&ct=citation&cd=0 M3 - KW - a_strange_tag KW - psychology KW - generality KW - genta11 L1 - SN - N1 - N1 - AB - ER - TY - JOUR AU - Talavera, Luis AU - Béjar, Javier T1 - Generality-Based Conceptual Clustering with Probabilistic Concepts JO - IEEE Transactions on Pattern Analysis and Machine Intelligence PY - 2001/ VL - 23 IS - 2 SP - 196 EP - 206 UR - http://www.computer.org/portal/web/csdl/doi/10.1109/34.908969 M3 - 10.1109/34.908969, KW - generality KW - genta11 KW - clustering L1 - SN - N1 - Generality-Based Conceptual Clustering with Probabilistic Concepts N1 - AB - ER - TY - JOUR AU - Breland, H.M. T1 - WORD FREQUENCY AND WORD DIFFICULTY JO - Psychological Science PY - 1996/ VL - 7 IS - 2 SP - 96 EP - 99 UR - http://scholar.google.de/scholar.bib?q=info:wgQfeigm3EgJ:scholar.google.com/&output=citation&hl=de&as_sdt=2000&ct=citation&cd=0 M3 - KW - genta11 KW - word_frequency L1 - SN - N1 - N1 - AB - ER - TY - CONF AU - Lin, Chin-Yew A2 - T1 - Knowledge-based automatic topic identification T2 - Proceedings of the 33rd annual meeting on Association for Computational Linguistics PB - Association for Computational Linguistics CY - Morristown, NJ, USA PY - 1995/ M2 - VL - IS - SP - 308 EP - 310 UR - http://dx.doi.org/10.3115/981658.981705 M3 - http://dx.doi.org/10.3115/981658.981705 KW - concept_generality KW - genta11 L1 - SN - N1 - N1 - AB - ER - TY - JOUR AU - Campos, A. AU - González, M.A. T1 - Imagery, concreteness, emotionality, and meaningfulness values of words: replication and extension. JO - Perceptual and motor skills PY - 1992/ VL - 74 IS - 3 Pt 1 SP - EP - UR - http://scholar.google.de/scholar.bib?q=info:yz8nXOXQIlcJ:scholar.google.com/&output=citation&hl=de&as_sdt=2000&as_vis=1&ct=citation&cd=0 M3 - KW - genta11 L1 - SN - N1 - N1 - AB - ER - TY - JOUR AU - Tanaka, James W. AU - Taylor, Marjorie T1 - Object categories and expertise: Is the basic level in the eye of the beholder? JO - Cognitive Psychology PY - 1991/07 VL - 23 IS - 3 SP - 457 EP - 482 UR - http://dx.doi.org.arugula.cc.columbia.edu:2048/10.1016/0010-0285(91)90016-H M3 - 10.1016/0010-0285(91)90016-H KW - genta11 L1 - SN - N1 - CiteULike: Object categories and expertise: Is the basic level in the eye of the beholder? N1 - AB - Classic research on conceptual hierarchies has shown that the interaction between the human perceiver and objects in the environment specifies one level of abstraction for categorizing objects, called the basic level, which plays a primary role in cognition. The question of whether the special psychological status of the basic level can be modified by experience was addressed in three experiments comparing the performance of subjects in expert and novice domains. The main findings were that in the domain of expertise (a) subordinate-level categories were as differentiated as the basic-level categories, (b) subordinate-level names were used as frequently as basic-level names for identifying objects, and (c) subordinate-level categorizations were as fast as basic-level categorizations. Taken together, these results demonstrate that individual differences in domain-specific knowledge affect the extent that the basic level is central to categorization. ER - TY - JOUR AU - Kammann, Richard AU - Streeter, Lynn T1 - Two meanings of word abstractness JO - Journal of Verbal Learning and Verbal Behavior PY - 1971/ VL - 10 IS - 3 SP - 303 EP - 306 UR - http://www.sciencedirect.com/science/article/B7MD4-4H3SDH5-C/2/4cd1d130114a240f69c90eec2391272d M3 - 10.1016/S0022-5371(71)80058-0 KW - genta11 L1 - SN - N1 - ScienceDirect - Journal of Verbal Learning and Verbal Behavior : Two meanings of word abstractness N1 - AB - Word abstractness has been defined in terms of hierarchical superordination or empirical ratings based on accessibility to the senses. Since a high-level superordinate (a generic term) should not be accessible to the senses, the two definitions should be correlated. Four Ss constructed word hierarchies from a pool of 925 nouns. Neither the size of a patriarch's hierarchy, nor its status as a superordinate was noticeably predictive of its abstractness rating, while its particular hierarchy membership was. The two definitions of abstractness appear to be mostly orthogonal. Subjects appear to rate the abstractness of a generic noun in terms of the abstractness of its exemplars. ER - TY - BOOK AU - Brown, Roger A2 - T1 - Words and Things PB - Free Press AD - PY - 1968/ VL - IS - SP - EP - UR - http://www.amazon.com/Words-Things-Roger-Brown/dp/0029048109 M3 - KW - genta11 L1 - SN - 0029048109 N1 - Amazon.com: Words and Things (9780029048108): Roger Brown: Books N1 - AB - ER - TY - JOUR AU - PAIVIO, ALLAN AU - YUILLE, JOHN C. AU - MADIGAN, STEPHEN A. T1 - CONCRETENESS, IMAGERY, AND MEANINGFULNESS VALUES FOR 925 NOUNS JO - Journal of Experimental Psychology PY - 1968/ VL - 76 IS - 1, Part 2 SP - 1 EP - 25 UR - http://www.sciencedirect.com/science/article/B8JB9-4NRM2J8-1/2/e3f0be65c21d3635cee939cc4d6f4d92 M3 - 10.1037/h0025327 KW - genta11 L1 - SN - N1 - ScienceDirect - Journal of Experimental Psychology : CONCRETENESS, IMAGERY, AND MEANINGFULNESS VALUES FOR 925 NOUNS N1 - AB - GROUPS OF SS, 17-46 YR. OLD COLLEGE STUDENTS, WERE USED TO SCALE 925 NOUNS ON ABSTRACTNESS-CONCRETENESS (C), IMAGERY (I), AND MEANINGFULNESS (M). CONCRETENESS WAS DEFINED IN TERMS OF DIRECTNESS OF REFERENCE TO SENSE EXPERIENCE, AND I, IN TERMS OF WORD'S CAPACITY TO AROUSE NONVERBAL IMAGES; C AND I WERE RATED ON 7-POINT SCALES. MEANINGFULNESS WAS DEFINED IN TERMS OF THE MEAN NUMBER OF WRITTEN ASSOCIATIONS IN 30 SEC. THE MEAN SCALE VALUES FOR THESE VARIABLES ARE PRESENTED FOR EACH OF THE 925 NOUNS. ALSO REPORTED ARE THE INTERCORRELATIONS OF THE VARIABLES, TOGETHER WITH AN EXAMINATION OF THE WORDS FOR WHICH C, I, AND M VALUES ARE MOST CLEARLY DIFFERENTIATED; AND RELIABILITY DATA, INCLUDING COMPARISONS WITH SCALE VALUES FOR THE VARIABLES FROM OTHER STUDIES. (45 REF.) (PsycINFO Database Record (c) 2006 APA, all rights reserved) ER - TY - JOUR AU - Paivio, Allan AU - Yuille, John C. T1 - Word abstractness and meaningfulness, and paired-associate learning in children JO - Journal of Experimental Child Psychology PY - 1966/ VL - 4 IS - 1 SP - 81 EP - 89 UR - http://www.sciencedirect.com/science/article/B6WJ9-4D706XW-KC/2/97aaad65d1601b8dd62be7558e3ac34f M3 - 10.1016/0022-0965(66)90052-X KW - genta11 L1 - SN - N1 - ScienceDirect - Journal of Experimental Child Psychology : Word abstractness and meaningfulness, and paired-associate learning in children*1 N1 - AB - Research with adults has shown that paired-associate (PA) learning of nouns, with abstractness-concreteness of the words simultaneously varied on both sides of pairs, is facilitated by concreteness, and this effect is greater on the stimulus than on the response side. The problem was investigated further in the present study with fourth-, sixth-, and eighth-grade children. Since concreteness has been found to correlate with meaningfulness (m), data were first obtained on the m of 32 concrete and 32 abstract, high-frequency nouns. At all three grade levels, the m of concrete nouns was higher than that of abstract nouns, and the words significantly retained their m rank across grades. Four comparable versions of a 16-pair list were constructed from 32 of the nouns, each list including 4 pairs of each possible S-R combination, i.e., concrete-concrete, concrete-abstract, abstract-concrete, and abstract-abstract. Groups of Sa were auditorially presented 4 alternating study trials and recall trials with a list. Analysis of the recall scores for Ss from each of three schools showed that recall increased with grade, and that positive effects of concreteness were generally greater on the stimulus than on the response side of pairs. The differential effect favoring stimulus over response concreteness was, however, smaller than in the earlier research with adults, and somewhat inconsistent across schools. ER - TY - JOUR AU - Hall, J.F. T1 - Learning as a function of word-frequency JO - The American Journal of Psychology PY - 1954/ VL - 67 IS - 1 SP - 138 EP - 140 UR - http://scholar.google.de/scholar.bib?q=info:R5OP8KAzPowJ:scholar.google.com/&output=citation&hl=de&as_sdt=2000&ct=citation&cd=28 M3 - KW - genta11 KW - word_frequency L1 - SN - N1 - N1 - AB - ER -