Sciweavers

1756 search results - page 254 / 352
» Mining Query Logs
Sort
View
SIGMOD
2004
ACM
100views Database» more  SIGMOD 2004»
14 years 10 months ago
Cost-Based Labeling of Groups of Mass Spectra
We make two main contributions in this paper. First, we motivate and introduce a novel class of data mining problems that arise in labeling a group of mass spectra, specifically f...
Lei Chen 0003, Zheng Huang, Raghu Ramakrishnan
WSDM
2010
ACM
199views Data Mining» more  WSDM 2010»
14 years 7 months ago
A Sketch-Based Distance Oracle for Web-Scale Graphs
We study the fundamental problem of computing distances between nodes in large graphs such as the web graph and social networks. Our objective is to be able to answer distance que...
Atish Das Sarma, Sreenivas Gollapudi, Marc Najork,...
WSDM
2009
ACM
113views Data Mining» more  WSDM 2009»
14 years 4 months ago
Time Will Tell: Leveraging Temporal Expressions in IR
Temporal expressions, such as between 1992 and 2000, are frequent across many kinds of documents. Text retrieval, though, treats them as common terms, thus ignoring their inherent...
Irem Arikan, Srikanta J. Bedathur, Klaus Berberich
ADMA
2009
Springer
142views Data Mining» more  ADMA 2009»
14 years 4 months ago
Crawling Deep Web Using a New Set Covering Algorithm
Abstract. Crawling the deep web often requires the selection of an appropriate set of queries so that they can cover most of the documents in the data source with low cost. This ca...
Yan Wang, Jianguo Lu, Jessica Chen
PKDD
2009
Springer
118views Data Mining» more  PKDD 2009»
14 years 4 months ago
Protein Identification from Tandem Mass Spectra with Probabilistic Language Modeling
This paper presents an interdisciplinary investigation of statistical information retrieval (IR) techniques for protein identification from tandem mass spectra, a challenging probl...
Yiming Yang, Abhay Harpale, Subramaniam Ganapathy