Sciweavers

2497 search results - page 301 / 500
» Image mining by content
Sort
View
KDD
2009
ACM
170views Data Mining» more  KDD 2009»
16 years 4 months ago
Genre-based decomposition of email class noise
Corruption of data by class-label noise is an important practical concern impacting many classification problems. Studies of data cleaning techniques often assume a uniform label ...
Aleksander Kolcz, Gordon V. Cormack
KDD
2002
ACM
147views Data Mining» more  KDD 2002»
16 years 4 months ago
A parallel learning algorithm for text classification
Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify te...
Canasai Kruengkrai, Chuleerat Jaruskulchai
KDD
2002
ACM
169views Data Mining» more  KDD 2002»
16 years 4 months ago
A Framework for Customizable Sports Video Management and Retrieval
Several domain specific approaches for sports video management have shown the benefits of integrating low- and high- level video contents in supporting more robust retrieval. Howev...
Dian Tjondronegoro, Yi-Ping Phoebe Chen, Binh Pham
WSDM
2009
ACM
148views Data Mining» more  WSDM 2009»
15 years 11 months ago
Information arbitrage across multi-lingual Wikipedia
The rapid globalization of Wikipedia is generating a parallel, multi-lingual corpus of unprecedented scale. Pages for the same topic in many different languages emerge both as a r...
Eytan Adar, Michael Skinner, Daniel S. Weld
CIKM
2009
Springer
15 years 10 months ago
Identifying comparable entities on the web
Web search engines are often presented with user queries that involve comparisons of real-world entities. Thus far, this interaction has typically been captured by users submittin...
Alpa Jain, Patrick Pantel