Sciweavers

910 search results - page 34 / 182
» Testbed for information extraction from deep web
Sort
View
WWW
2010
ACM
14 years 24 days ago
Web-scale knowledge extraction from semi-structured tables
A wealth of knowledge is encoded in the form of tables on the World Wide Web. We propose a classification algorithm and a rich feature set for automatically recognizing layout tab...
Eric Crestan, Patrick Pantel
CIKM
2008
Springer
13 years 10 months ago
Coreex: content extraction from online news articles
We developed and tested a heuristic technique for extracting the main article from news site Web pages. We construct the DOM tree of the page and score every node based on the amo...
Jyotika Prasad, Andreas Paepcke
COLING
2010
13 years 3 months ago
A Method for Automatically Generating a Mediatory Summary to Verify Credibility of Information on the Web
In this paper, we propose a method for mediatory summarization, which is a novel technique for facilitating users' assessments of the credibility of information on the Web. A...
Hideyuki Shibuki, Takahiro Nagai, Masahiro Nakano,...
WWW
2010
ACM
14 years 3 months ago
Automatic extraction of clickable structured web contents for name entity queries
Today the major web search engines answer queries by showing ten result snippets, which need to be inspected by users for identifying relevant results. In this paper we investigat...
Xiaoxin Yin, Wenzhao Tan, Xiao Li, Yi-Chin Tu
WWW
2010
ACM
14 years 3 months ago
Entity relation discovery from web tables and links
The World-Wide Web consists not only of a huge number of unstructured texts, but also a vast amount of valuable structured data. Web tables [2] are a typical type of structured in...
Cindy Xide Lin, Bo Zhao, Tim Weninger, Jiawei Han,...