Sciweavers

WSDM
2010
ACM
188views Data Mining» more  WSDM 2010»
14 years 6 months ago
Anatomy of the Long Tail: Ordinary People with Extraordinary Tastes
The success of "infinite-inventory" retailers such as Amazon.com and Netflix has been ascribed to a "long tail" phenomenon. To wit, while the majority of their...
Andrei Z. Broder, Bo Pang, Evgeniy Gabrilovich, Sh...
WSDM
2010
ACM
213views Data Mining» more  WSDM 2010»
14 years 6 months ago
Corroborating Information from Disagreeing Views
We consider a set of views stating possibly conflicting facts. Negative facts in the views may come, e.g., from functional dependencies in the underlying database schema. We want ...
Alban Galland, Serge Abiteboul, Amélie Mari...
WSDM
2010
ACM
147views Data Mining» more  WSDM 2010»
14 years 6 months ago
On Compressing the Textual Web
Paolo Ferragina, Giovanni Manzini
WSDM
2010
ACM
210views Data Mining» more  WSDM 2010»
14 years 6 months ago
Leveraging Temporal Dynamics of Document Content in Relevance Ranking
Many web documents are dynamic, with content changing in varying amounts at varying frequencies. However, current document search algorithms have a static view of the document con...
Jonathan L. Elsas, Susan T. Dumais
WSDM
2010
ACM
197views Data Mining» more  WSDM 2010»
14 years 6 months ago
Adapting Information Bottleneck Method for Automatic Construction of Domain-oriented Sentiment Lexicon
Domain-oriented sentiment lexicons are widely used for finegrained sentiment analysis on reviews; therefore, the automatic construction of domain-oriented sentiment lexicon is a f...
Songbo Tan, Weifu Du, Xiaochun Yun, Xueqi Cheng
WSDM
2010
ACM
210views Data Mining» more  WSDM 2010»
14 years 6 months ago
Towards Recency Ranking in Web Search
In web search, recency ranking refers to ranking documents by relevance which takes freshness into account. In this paper, we propose a retrieval system which automatically detect...
Anlei Dong, Yi Chang, Zhaohui Zheng, Gilad Mishne,...
WSDM
2010
ACM
203views Data Mining» more  WSDM 2010»
14 years 6 months ago
Query Reformulation Using Anchor Text
Query reformulation techniques based on query logs have been studied as a method of capturing user intent and improving retrieval effectiveness. The evaluation of these techniques...
Van Dang, Bruce W. Croft
WSDM
2010
ACM
236views Data Mining» more  WSDM 2010»
14 years 6 months ago
Personalized Click Prediction in Sponsored Search
Sponsored search is a multi-billion dollar business that generates most of the revenue for search engines. Predicting the probability that users click on ads is crucial to sponsor...
Erick Cantú-Paz, Haibin Cheng
WSDM
2010
ACM
173views Data Mining» more  WSDM 2010»
14 years 6 months ago
Measuring the Reusability of Test Collections
While test collection construction is a time-consuming and expensive process, the true cost is amortized by reusing the collection over hundreds or thousands of experiments. Some ...
Ben Carterette, Evgeniy Gabrilovich, Vanja Josifov...
WSDM
2010
ACM
170views Data Mining» more  WSDM 2010»
14 years 6 months ago
Coupled Semi-Supervised Learning for Information Extraction
Andrew Carlson, Justin Betteridge, Richard C. Wang...