Sciweavers

2497 search results - page 347 / 500
» A Partial-Repeatability Approach to Data Mining
Sort
View
140
Voted
WSDM
2010
ACM
199views Data Mining» more  WSDM 2010»
16 years 1 months ago
A Sketch-Based Distance Oracle for Web-Scale Graphs
We study the fundamental problem of computing distances between nodes in large graphs such as the web graph and social networks. Our objective is to be able to answer distance que...
Atish Das Sarma, Sreenivas Gollapudi, Marc Najork,...
116
Voted
WSDM
2009
ACM
113views Data Mining» more  WSDM 2009»
15 years 10 months ago
Time Will Tell: Leveraging Temporal Expressions in IR
Temporal expressions, such as between 1992 and 2000, are frequent across many kinds of documents. Text retrieval, though, treats them as common terms, thus ignoring their inherent...
Irem Arikan, Srikanta J. Bedathur, Klaus Berberich
WSDM
2009
ACM
131views Data Mining» more  WSDM 2009»
15 years 10 months ago
Diversifying search results
We study the problem of answering ambiguous web queries in a setting where there exists a taxonomy of information, and that both queries and documents may belong to more than one ...
Rakesh Agrawal, Sreenivas Gollapudi, Alan Halverso...
129
Voted
ICDM
2009
IEEE
165views Data Mining» more  ICDM 2009»
15 years 10 months ago
Cross-Guided Clustering: Transfer of Relevant Supervision across Domains for Improved Clustering
—Lack of supervision in clustering algorithms often leads to clusters that are not useful or interesting to human reviewers. We investigate if supervision can be automatically tr...
Indrajit Bhattacharya, Shantanu Godbole, Sachindra...
139
Voted
ICDM
2009
IEEE
109views Data Mining» more  ICDM 2009»
15 years 10 months ago
Semi-naive Exploitation of One-Dependence Estimators
—It is well known that the key of Bayesian classifier learning is to balance the two important issues, that is, the exploration of attribute dependencies in high orders for ensu...
Nan Li, Yang Yu, Zhi-Hua Zhou