Sciweavers

684 search results - page 4 / 137
» Extracting semantic structure of web documents using content...
Sort
View
KAIS
2006
102views more  KAIS 2006»
13 years 7 months ago
Visual information extraction
Typographic and visual information is an integral part of textual documents. Most information extraction systems ignore most of this visual information, processing the text as a l...
Yonatan Aumann, Ronen Feldman, Yair Liberzon, Biny...
WWW
2003
ACM
14 years 8 months ago
Improving pseudo-relevance feedback in web information retrieval using web page segmentation
In contrast to traditional document retrieval, a web page as a whole is not a good information unit to search because it often contains multiple topics and a lot of irrelevant inf...
Shipeng Yu, Deng Cai, Ji-Rong Wen, Wei-Ying Ma
CIKM
2008
Springer
13 years 9 months ago
Using structured text for large-scale attribute extraction
We propose a weakly-supervised approach for extracting class attributes from structured text available within Web documents. The overall precision of the extracted attributes is a...
Sujith Ravi, Marius Pasca
SIGIR
2004
ACM
14 years 27 days ago
Query-related data extraction of hidden web documents
The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...
Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...
CIKM
2010
Springer
13 years 6 months ago
Automatic metadata extraction from multilingual enterprise content
Enterprises provide professionally authored content about their products/services in different languages for use in web sites and customer care. For customer care, personalization...
Melike Sah, Vincent Wade