QuickSearch:   Number of matching entries: 0.

Search Settings

    AuthorTitleYearJournal/ProceedingsReftypeDOI/URL
    Crane, G. What Do You Do with a Million Books? 2006 D-Lib Magazine
    Vol. 12(3) 
    article DOI URL 
    BibTeX:
    @article{march06crane,
      author = {Crane, Gregory},
      title = {What Do You Do with a Million Books?},
      journal = {D-Lib Magazine},
      year = {2006},
      volume = {12},
      number = {3},
      url = {http://www.dlib.org/dlib/march06/crane/03crane.html},
      doi = {http://dx.doi.org/10.1045/march2006-crane}
    }
    
    Li, X., Thelwall, M. & Giustini, D. Validating online reference managers for scholarly impact measurement 2012 Scientometrics
    Vol. 91(2), pp. 461-471 
    article DOI URL 
    Abstract: This paper investigates whether CiteULike and Mendeley are useful for measuring scholarly influence, using a sample of 1,613 papers published in Nature and Science in 2007. Traditional citation counts from the Web of Science (WoS) were used as benchmarks to compare with the number of users who bookmarked the articles in one of the two free online reference manager sites. Statistically significant correlations were found between the user counts and the corresponding WoS citation counts, suggesting that this type of influence is related in some way to traditional citation-based scholarly impact but the number of users of these systems seems to be still too small for them to challenge traditional citation indexes.
    BibTeX:
    @article{li2012validating,
      author = {Li, Xuemei and Thelwall, Mike and Giustini, Dean},
      title = {Validating online reference managers for scholarly impact measurement},
      journal = {Scientometrics},
      publisher = {Springer Netherlands},
      year = {2012},
      volume = {91},
      number = {2},
      pages = {461-471},
      url = {http://dx.doi.org/10.1007/s11192-011-0580-x},
      doi = {http://dx.doi.org/10.1007/s11192-011-0580-x}
    }
    
    Haveliwala, T. & Kamvar, S. The second eigenvalue of the Google matrix 2003   misc URL 
    BibTeX:
    @misc{haveliwala03second,
      author = {Haveliwala, T. and Kamvar, S.},
      title = {The second eigenvalue of the Google matrix},
      year = {2003},
      url = {http://citeseer.ist.psu.edu/haveliwala03second.html}
    }
    
    Haveliwala, T. & Kamvar, S. The second eigenvalue of the Google matrix 2003   misc URL 
    BibTeX:
    @misc{haveliwala03second,
      author = {Haveliwala, T. and Kamvar, S.},
      title = {The second eigenvalue of the Google matrix},
      year = {2003},
      url = {citeseer.ist.psu.edu/haveliwala03second.html}
    }
    
    El Ahmad, A.S., Yan, J. & Tayara, M. The Robustness of Google CAPTCHAs 2011   techreport URL 
    Abstract: We report a novel attack on two CAPTCHAs that have been widely deployed on the Internet, one being Google's home design and the other acquired by Google (i.e. reCAPTCHA). With a minor change, our attack program also works well on the latest ReCAPTCHA version, which uses a new defence mechanism that was unknown to us when we designed our attack. This suggests that our attack works in a fundamental level. Our attack appears to be applicable to a whole family of text CAPTCHAs that build on top of the popular segmentation-resistant mechanism of "crowding character together" for security. Next, we propose a novel framework that guides the application of our well-tested security engineering methodology for evaluating CAPTCHA robustness, and we propose a new general principle for CAPTCHA design.
    BibTeX:
    @techreport{elahmad2011robustness,
      author = {El Ahmad, Ahmad S and Yan, Jeff and Tayara, Mohamad},
      title = {The Robustness of Google CAPTCHAs},
      year = {2011},
      url = {http://homepages.cs.ncl.ac.uk/jeff.yan/google.pdf}
    }
    
    Cilibrasi, R. & Vitanyi, P.M.B. The Google Similarity Distance 2007 IEEE Transactions on Knowledge and Data Engineering
    Vol. 19, pp. 370 
    article URL 
    Abstract: Words and phrases acquire meaning from the way they are used in society, from their relative semantics to other words and phrases. For computers the equivalent of `society' is `database,' and the equivalent of `use' is `way to search the database.' We present a new theory of similarity between words and phrases based on information distance and Kolmogorov complexity. To fix thoughts we use the world-wide-web as database, and Google as search engine. The method is also applicable to other search engines and databases. This theory is then applied to construct a method to automatically extract similarity, the Google similarity distance, of words and phrases from the world-wide-web using Google page counts. The world-wide-web is the largest database on earth, and the context information entered by millions of independent users averages out to provide automatic semantics of useful quality. We give applications in hierarchical clustering, classification, and language translation. We give examples to distinguish between colors and numbers, cluster names of paintings by 17th century Dutch masters and names of books by English novelists, the ability to understand emergencies, and primes, and we demonstrate the ability to do a simple automatic English-Spanish translation. Finally, we use the WordNet database as an objective baseline against which to judge the performance of our method. We conduct a massive randomized trial in binary classification using support vector machines to learn categories based on our Google distance, resulting in an a mean agreement of 87% with the expert crafted WordNet categories.
    BibTeX:
    @article{cilibrasi2007google,
      author = {Cilibrasi, Rudi and Vitanyi, Paul M. B.},
      title = {The Google Similarity Distance},
      journal = {IEEE Transactions on Knowledge and Data Engineering},
      year = {2007},
      volume = {19},
      pages = {370},
      url = {http://www.citebase.org/abstract?id=oai:arXiv.org:cs/0412098}
    }
    
    Brin, S. & Page, L. The anatomy of a large-scale hypertextual Web search engine 1998 Computer Networks and ISDN Systems
    Vol. 30(1-7), pp. 107-117 
    article URL 
    Abstract: In this paper, we present Google, a prototype of a large-scale search engine which makes heavy use of the structure present in hypertext. Google is designed to crawl and index the Web efficiently and produce much more satisfying search results than existing systems. The prototype with a full text and hyperlink database of at least 24 million pages is available at http://infolab.stanford.edu/~backrub/google.html To engineer a search engine is a challenging task. Search engines index tens to hundreds of millions of web pages involving a comparable number of distinct terms. They answer tens of millions of queries every day. Despite the importance of large-scale search engines on the web, very little academic research has been done on them. Furthermore, due to rapid advance in technology and web proliferation, creating a web search engine today is very different from three years ago. This paper provides an in-depth description of our large-scale web search engine -- the first such detailed public description w...
    BibTeX:
    @article{brin1998anatomy,
      author = {Brin, Sergey and Page, Lawrence},
      title = {The anatomy of a large-scale hypertextual Web search engine},
      journal = {Computer Networks and ISDN Systems},
      year = {1998},
      volume = {30},
      number = {1--7},
      pages = {107--117},
      url = {http://citeseer.ist.psu.edu/brin98anatomy.html}
    }
    
    Haley, M.R. Ranking top economics and finance journals using Microsoft academic search versus Google scholar: How does the new publish or perish option compare? 2014 Journal of the Association for Information Science and Technology
    Vol. 65(5), pp. 1079-1084 
    article DOI URL 
    Abstract: Recently, Harzing's Publish or Perish software was updated to include Microsoft Academic Search as a second citation database search option for computing various citation-based metrics. This article explores the new search option by scoring 50 top economics and finance journals and comparing them with the results obtained using the original Google Scholar-based search option. The new database delivers significantly smaller scores for all metrics, but the rank correlations across the two databases for the h-index, g-index, AWCR, and e-index are significantly correlated, especially when the time frame is restricted to more recent years. Comparisons are also made to the Article Influence score from eigenfactor.org and to the RePEc h-index, both of which adjust for journal-level self-citations.
    BibTeX:
    @article{haley2014ranking,
      author = {Haley, M. Ryan},
      title = {Ranking top economics and finance journals using Microsoft academic search versus Google scholar: How does the new publish or perish option compare?},
      journal = {Journal of the Association for Information Science and Technology},
      year = {2014},
      volume = {65},
      number = {5},
      pages = {1079--1084},
      url = {http://dx.doi.org/10.1002/asi.23080},
      doi = {http://dx.doi.org/10.1002/asi.23080}
    }
    
    López-Cózar, E.D., Robinson-García, N. & Torres-Salinas, D. Manipulating Google Scholar Citations and Google Scholar Metrics: simple, easy and tempting 2012   misc URL 
    Abstract: The launch of Google Scholar Citations and Google Scholar Metrics may provoke
    revolution in the research evaluation field as it places within every
    searchers reach tools that allow bibliometric measuring. In order to alert
    e research community over how easily one can manipulate the data and
    bliometric indicators offered by Google s products we present an experiment
    which we manipulate the Google Citations profiles of a research group
    rough the creation of false documents that cite their documents, and
    nsequently, the journals in which they have published modifying their H
    dex. For this purpose we created six documents authored by a faked author and
    uploaded them to a researcher s personal website under the University of
    anadas domain. The result of the experiment meant an increase of 774
    tations in 129 papers (six citations per paper) increasing the authors and
    urnals H index. We analyse the malicious effect this type of practices can
    use to Google Scholar Citations and Google Scholar Metrics. Finally, we
    nclude with several deliberations over the effects these malpractices may
    ve and the lack of control tools these tools offer
    BibTeX:
    @misc{lpezczar2012manipulating,
      author = {López-Cózar, Emilio Delgado and Robinson-García, Nicolás and Torres-Salinas, Daniel},
      title = {Manipulating Google Scholar Citations and Google Scholar Metrics:
    simple, easy and tempting}, year = {2012}, note = {cite arxiv:1212.0638Comment: 10 pages, 4 figures}, url = {http://arxiv.org/abs/1212.0638} }
    Schnabel, C. Keine Sperrung von Google durch Access-Provider 2008   article  
    BibTeX:
    @article{Schnabel2008g,
      author = {Schnabel, Christoph},
      title = {Keine Sperrung von Google durch Access-Provider},
      year = {2008}
    }
    
    Melnik, S., Gubarev, A., Long, J.J., Romer, G., Shivakumar, S., Tolton, M. & Vassilakis, T. Dremel: interactive analysis of web-scale datasets 2010 Proceedings of the VLDB Endowment
    Vol. 3(1-2), pp. 330-339 
    article URL 
    Abstract: Dremel is a scalable, interactive ad-hoc query system for analysis of read-only nested data. By combining multi-level execution trees and columnar data layout, it is capable of running aggregation queries over trillion-row tables in seconds. The system scales to thousands of CPUs and petabytes of data, and has thousands of users at Google. In this paper, we describe the architecture and implementation of Dremel, and explain how it complements MapReduce-based computing. We present a novel columnar storage representation for nested records and discuss experiments on few-thousand node instances of the system.
    BibTeX:
    @article{melnik2010dremel,
      author = {Melnik, Sergey and Gubarev, Andrey and Long, Jing Jing and Romer, Geoffrey and Shivakumar, Shiva and Tolton, Matt and Vassilakis, Theo},
      title = {Dremel: interactive analysis of web-scale datasets},
      journal = {Proceedings of the VLDB Endowment},
      publisher = {VLDB Endowment},
      year = {2010},
      volume = {3},
      number = {1-2},
      pages = {330--339},
      url = {http://dl.acm.org/citation.cfm?id=1920841.1920886}
    }
    
    Rahm, E. & Thor, A. Citation analysis of database publications 2005 SIGMOD Rec.
    Vol. 34(4), pp. 48-53 
    article DOI  
    BibTeX:
    @article{1107505,
      author = {Rahm, Erhard and Thor, Andreas},
      title = {Citation analysis of database publications},
      journal = {SIGMOD Rec.},
      publisher = {ACM Press},
      year = {2005},
      volume = {34},
      number = {4},
      pages = {48--53},
      doi = {http://doi.acm.org/10.1145/1107499.1107505}
    }
    

    Created by JabRef on 19/04/2024.