Sciweavers

98 search results - page 7 / 20
» Towards domain-independent information extraction from web t...
Sort
View
RULEML
2004
Springer
14 years 27 days ago
Rule Learning for Feature Values Extraction from HTML Product Information Sheets
The Web is now a huge information repository with a rich semantic structure that, however, is primarily addressed to human understanding rather than automated processing by a compu...
Costin Badica, Amelia Badica
JCDL
2011
ACM
218views Education» more  JCDL 2011»
12 years 10 months ago
Retrieving attributes using web tables
In this paper we propose an attribute retrieval approach which extracts and ranks attributes from Web tables. We use simple heuristics to filter out improbable attributes and we ...
Arlind Kopliku, Karen Pinel-Sauvagnat, Mohand Boug...
WWW
2011
ACM
13 years 2 months ago
FACTO: a fact lookup engine based on web tables
Recently answers for fact lookup queries have appeared on major search engines. For example, for the query {Barack Obama date of birth} Google directly shows “4 August 1961” a...
Xiaoxin Yin, Wenzhao Tan, Chao Liu
IJMSO
2008
149views more  IJMSO 2008»
13 years 7 months ago
Categorisation of web documents using extraction ontologies
: Automatically recognising which HTML documents on the Web contain items of interest for a user is non-trivial. As a step toward solving this problem, we propose an approach based...
Li Xu, David W. Embley
KDD
2007
ACM
155views Data Mining» more  KDD 2007»
14 years 8 months ago
Mining templates from search result records of search engines
Metasearch engine, Comparison-shopping and Deep Web crawling applications need to extract search result records enwrapped in result pages returned from search engines in response ...
Hongkun Zhao, Weiyi Meng, Clement T. Yu