Sciweavers

1947 search results - page 89 / 390
» On the Automatic Extraction of Data from the Hidden Web
Sort
View
VLDB
2011
ACM
251views Database» more  VLDB 2011»
13 years 3 months ago
Harvesting relational tables from lists on the web
A large number of web pages contain data structured in the form of “lists”. Many such lists can be further split into multi-column tables, which can then be used in more seman...
Hazem Elmeleegy, Jayant Madhavan, Alon Y. Halevy
DOCENG
2009
ACM
14 years 3 months ago
Web document text and images extraction using DOM analysis and natural language processing
: © Web Document Text and Images Extraction using DOM Analysis and Natural Language Processing Parag Mulendra Joshi, Sam Liu HP Laboratories HPL-2009-187 Web page text extraction,...
Parag Mulendra Joshi, Sam Liu
IJCAI
2003
13 years 10 months ago
Information Extraction from Tree Documents by Learning Subtree Delimiters
Information extraction from HTML pages has been conventionally treated as plain text documents extended with HTML tags. However, the growing maturity and correct usage of HTML/XHT...
Boris Chidlovskii
HIM
1997
Springer
169views Multimedia» more  HIM 1997»
14 years 29 days ago
High Precision Hypertext Navigation Based on NLP Automatic Extractions
The growing popularity of hypertext navigation systems and the availability of large documentary databases is leading to the design of navigation systems that allow to explore the...
Frantz Vichot, Francis Wolinski, Joseph Tomeh, Syl...
SIGPRO
2008
155views more  SIGPRO 2008»
13 years 8 months ago
Recognition of off-line printed Arabic text using Hidden Markov Models
This paper describes a technique for automatic recognition of off-line printed Arabic text using Hidden Markov Models. In this work different sizes of overlapping and non-overlapp...
Husni A. Al-Muhtaseb, Sabri A. Mahmoud, Rami Qahwa...