Sciweavers

160 search results - page 6 / 32
» Exploiting structural information for semi-structured docume...
Sort
View
ECML
2006
Springer
13 years 9 months ago
Distributional Features for Text Categorization
Abstract-- Text categorization is the task of assigning predefined categories to natural language text. With the widely used `bag of words' representation, previous researches...
Xiao-Bing Xue, Zhi-Hua Zhou
ECIR
2003
Springer
13 years 9 months ago
Discretizing Continuous Attributes in AdaBoost for Text Categorization
Abstract. We focus on two recently proposed algorithms in the family of “boosting”-based learners for automated text classification, AdaBoost.MH and AdaBoost.MHKR . While the ...
Pio Nardiello, Fabrizio Sebastiani, Alessandro Spe...
DKE
2006
139views more  DKE 2006»
13 years 7 months ago
Information extraction from structured documents using k-testable tree automaton inference
Information extraction (IE) addresses the problem of extracting specific information from a collection of documents. Much of the previous work on IE from structured documents, suc...
Raymond Kosala, Hendrik Blockeel, Maurice Bruynoog...
IJCAI
1997
13 years 8 months ago
Toward Structured Retrieval in Semi-structured Information Spaces
A semi-structured information space consists of multiple collections of textual documents containing fielded or tagged sections. The space can be highly heterogeneous, because eac...
Scott B. Huffman, Catherine Baudin
RIAO
2004
13 years 9 months ago
Integrating XLink and XPath to Retrieve Structured Multimedia Documents in Digital Libraries
To support the retrieval of multimedia data according to user information needs, multimedia information retrieval in digital libraries must be based on semantics and not just prim...
Zhigang Kong, Mounia Lalmas