Sciweavers

1052 search results - page 8 / 211
» Improved CHAID algorithm for document structure modelling
Sort
View
KDD
2008
ACM
183views Data Mining» more  KDD 2008»
14 years 8 months ago
Structured entity identification and document categorization: two tasks with one joint model
Traditionally, research in identifying structured entities in documents has proceeded independently of document categorization research. In this paper, we observe that these two t...
Indrajit Bhattacharya, Shantanu Godbole, Sachindra...
ACL
2010
13 years 5 months ago
A Hybrid Hierarchical Model for Multi-Document Summarization
Scoring sentences in documents given abstract summaries created by humans is important in extractive multi-document summarization. In this paper, we formulate extractive summariza...
Asli Çelikyilmaz, Dilek Hakkani-Tur
APWEB
2010
Springer
13 years 6 months ago
Suggesting Topic-Based Query Terms as You Type
— Query term suggestion that interactively expands the queries is an indispensable technique to help users formulate high-quality queries and has attracted much attention in the ...
Ju Fan, Hao Wu, Guoliang Li, Lizhu Zhou
BNCOD
2007
121views Database» more  BNCOD 2007»
13 years 9 months ago
Indexing and Searching XML Documents Based on Content and Structure Synopses
We present a novel framework for indexing and searching schema-less XML documents based on concise summaries of their structural and textual content. Our search query language is X...
Weimin He, Leonidas Fegaras, David Levine
ICDAR
2009
IEEE
14 years 2 months ago
Enhanced Text Extraction from Arabic Degraded Document Images Using EM Algorithm
This paper presents a new enhanced text extraction algorithm from degraded document images on the basis of the probabilistic models. The observed document image is considered as a...
Wafa Boussellaa, Aymen Bougacha, Abderrazak Zahour...