Sciweavers

2677 search results - page 52 / 536
» Extracting Structured Data from Web Pages
Sort
View
ICDAR
2007
IEEE
13 years 10 months ago
Example-Based Logical Labeling of Document Title Page Images
This paper presents a flexible and effective examplebased approach for labeling title pages which can be used for automated extraction of bibliographic data. The labels of intere...
Joost van Beusekom, Daniel Keysers, Faisal Shafait...
DEXA
2008
Springer
181views Database» more  DEXA 2008»
13 years 10 months ago
Query Recommendation Using Large-Scale Web Access Logs and Web Page Archive
Query recommendation suggests related queries for search engine users when they are not satisfied with the results of an initial input query, thus assisting users in improving sear...
Lin Li, Shingo Otsuka, Masaru Kitsuregawa
NAACL
2003
13 years 10 months ago
A Web-Trained Extraction Summarization System
A serious bottleneck in the development of trainable text summarization systems is the shortage of training data. Constructing such data is a very tedious task, especially because...
Liang Zhou, Eduard H. Hovy
CIS
2005
Springer
14 years 2 months ago
A Method for Automating the Extraction of Specialized Information from the Web
The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer sys...
Ling Lin, Antonio Liotta, Andrew Hippisley
WWW
2008
ACM
14 years 9 months ago
Can chinese web pages be classified with english data source?
As the World Wide Web in China grows rapidly, mining knowledge in Chinese Web pages becomes more and more important. Mining Web information usually relies on the machine learning ...
Xiao Ling, Gui-Rong Xue, Wenyuan Dai, Yun Jiang, Q...