Sciweavers

429 search results - page 35 / 86
» Machine Learning for Intelligent Processing of Printed Docum...
Sort
View
PRICAI
2000
Springer
13 years 11 months ago
A Comparative Study on Chinese Text Categorization Methods
Abstract. This paper reports our comparative evaluation of three machine learning methods on Chinese text categorization. Whereas a wide range of methods have been applied to Engli...
Ji He, Ah-Hwee Tan, Chew Lim Tan
SIGIR
2003
ACM
14 years 1 months ago
eBizSearch: a niche search engine for e-business
Niche Search Engines offer an efficient alternative to traditional search engines when the results returned by general-purpose search engines do not provide a sufficient degree of...
C. Lee Giles, Yves Petinot, Pradeep B. Teregowda, ...
EMNLP
2009
13 years 5 months ago
Active Learning by Labeling Features
Methods that learn from prior information about input features such as generalized expectation (GE) have been used to train accurate models with very little effort. In this paper,...
Gregory Druck, Burr Settles, Andrew McCallum
SIGMOD
2008
ACM
123views Database» more  SIGMOD 2008»
14 years 8 months ago
SchemaScope: a system for inferring and cleaning XML schemas
We present SchemaScope, a system to derive Document Type Definitions and XML Schemas from corpora of sample XML documents. Tools are provided to visualize, clean, and refine exist...
Geert Jan Bex, Frank Neven, Stijn Vansummeren
CIKM
2007
Springer
14 years 2 months ago
The role of documents vs. queries in extracting class attributes from text
Challenging the implicit reliance on document collections, this paper discusses the pros and cons of using query logs rather than document collections, as self-contained sources o...
Marius Pasca, Benjamin Van Durme, Nikesh Garera