Sciweavers

2677 search results - page 38 / 536
» Extracting Structured Data from Web Pages
Sort
View
ESWA
2008
173views more  ESWA 2008»
13 years 8 months ago
Image semantics discovery from web pages for semantic-based image retrieval using self-organizing maps
Traditional content-based image retrieval (CBIR) systems often fail to meet a user's need due to the `semantic gap' between the extracted features of the systems and the...
Hsin-Chang Yang, Chung-Hong Lee
PODS
2004
ACM
189views Database» more  PODS 2004»
14 years 8 months ago
The Lixto Data Extraction Project - Back and Forth between Theory and Practice
We present the Lixto project, which is both a research project in database theory and a commercial enterprise that develops Web data extraction (wrapping) and Web service definiti...
Georg Gottlob, Christoph Koch, Robert Baumgartner,...
ACL
2008
13 years 10 months ago
Mining Parenthetical Translations from the Web by Word Alignment
Documents in languages such as Chinese, Japanese and Korean sometimes annotate terms with their translations in English inside a pair of parentheses. We present a method to extrac...
Dekang Lin, Shaojun Zhao, Benjamin Van Durme, Mari...
WWW
2011
ACM
13 years 3 months ago
Identifying primary content from web pages and its application to web search ranking
Web pages are usually highly structured documents. In some documents, content with different functionality is laid out in blocks, some merely supporting the main discourse. In ot...
Srinivas Vadrevu, Emre Velipasaoglu
FLAIRS
2003
13 years 10 months ago
Structural Web Search Engine
We present a new approach in web search engines. The web creates new challenges for information retrieval. The vast improvement in information access is not the only advantage res...
Arash Rakhshan, Lawrence B. Holder, Diane J. Cook