Sciweavers

985 search results - page 161 / 197
» XML Information Retrieval - Achievements and Challenges
Sort
View
WWW
2009
ACM
14 years 9 months ago
Ranking community answers via analogical reasoning
Due to the lexical gap between questions and answers, automatically detecting right answers becomes very challenging for community question-answering sites. In this paper, we prop...
Xudong Tu, Xin-Jing Wang, Dan Feng, Lei Zhang
WWW
2006
ACM
14 years 9 months ago
Using graph matching techniques to wrap data from PDF documents
Wrapping is the process of navigating a data source, semiautomatically extracting data and transforming it into a form suitable for data processing applications. There are current...
Tamir Hassan, Robert Baumgartner
KDD
2007
ACM
206views Data Mining» more  KDD 2007»
14 years 9 months ago
Automatic labeling of multinomial topic models
Multinomial distributions over words are frequently used to model topics in text collections. A common, major challenge in applying all such topic models to any text mining proble...
Qiaozhu Mei, Xuehua Shen, ChengXiang Zhai
KDD
2007
ACM
139views Data Mining» more  KDD 2007»
14 years 9 months ago
Raising the baseline for high-precision text classifiers
Many important application areas of text classifiers demand high precision and it is common to compare prospective solutions to the performance of Naive Bayes. This baseline is us...
Aleksander Kolcz, Wen-tau Yih
KDD
2002
ACM
186views Data Mining» more  KDD 2002»
14 years 9 months ago
Topic-conditioned novelty detection
Automated detection of the first document reporting each new event in temporally-sequenced streams of documents is an open challenge. In this paper we propose a new approach which...
Yiming Yang, Jian Zhang, Jaime G. Carbonell, Chun ...