Sciweavers

80 search results - page 4 / 16
» Automatic Category Generation for Text Documents by Self-Org...
Sort
View
WWW
2007
ACM
14 years 7 months ago
Academic web search engine: generating a survey automatically
Given a document repository, search engine is very helpful to retrieve information. Currently, vertical search is a hot topic, and Google Scholar [4] is an example for academic se...
Ye Wang, Zhihua Geng, Sheng Huang, Xiaoling Wang, ...
DOCENG
2008
ACM
13 years 9 months ago
Constructing a know-how repository of advices and warnings from procedural texts
In this paper, we show how a domain dependent know-how textual database of advices and warnings can be constructed from procedural texts. We show how arguments of type warnings an...
Lionel Fontan, Patrick Saint-Dizier
DOCENG
2009
ACM
14 years 1 months ago
From rhetorical structures to document structure: shallow pragmatic analysis for document engineering
In this paper, we extend previous work on the automatic structuring of medical documents using content analysis. Our long-term objective is to take advantage of specific rhetoric ...
Gersende Georg, Hugo Hernault, Marc Cavazza, Helmu...
DOCENG
2005
ACM
13 years 9 months ago
Schema matching for transforming structured documents
Structured document content reuse is the problem of restructuring and translating data structured under a source schema into an instance of a target schema. A notion closely tied ...
Aida Boukottaya, Christine Vanoirbeek
CIKM
2003
Springer
14 years 10 days ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...