Sciweavers

1002 search results - page 76 / 201
» Unsupervised Relation Extraction From Web Documents
Sort
View
RIAO
2000
13 years 9 months ago
Combining linguistic and spatial information for document analysis
We present a framework to analyze color documents of complex layout. In addition, no assumption is made on the layout. Our framework combines in a content-driven bottom-up approac...
Marco Aiello, Christof Monz, Leon Todoran
DEXAW
2008
IEEE
136views Database» more  DEXAW 2008»
13 years 10 months ago
Segmentation of Legislative Documents Using a Domain-Specific Lexicon
The amount of legal information is continuously growing. New legislative documents appear everyday in the Web. Legal documents are produced on a daily basis in briefingformat, cont...
Ismael Hasan, Javier Parapar, Roi Blanco
CIKM
2011
Springer
12 years 8 months ago
Integrating and querying web databases and documents
There exist many interrelated information sources on the Internet that can be categorized into structured (database) and semistructured (documents). A key challenge is to integrat...
Carlos Garcia-Alvarado, Carlos Ordonez
WWW
2010
ACM
14 years 3 months ago
CETR: content extraction via tag ratios
We present Content Extraction via Tag Ratios (CETR) – a method to extract content text from diverse webpages by using the HTML document’s tag ratios. We describe how to comput...
Tim Weninger, William H. Hsu, Jiawei Han
TREC
2000
13 years 9 months ago
The PISAB Question Answering System
The PISAB Question Answering system is based on a combination of Information Extraction and Information Retrieval techniques. Knowledge extracted from documents is modeled as a se...
Giuseppe Attardi, Cristian Burrini