Sciweavers

1145 search results - page 104 / 229
» Open Information Extraction from the Web
Sort
View
COMAD
2009
13 years 9 months ago
Business Insight from Collection of Unstructured Formatted Documents with IBM Content Harvester
In this paper, we report the development and experiments of IBM Content Harvester (CH), a tool to analyze and recover templates and content from word processor created text docume...
Biplav Srivastava, Yuan-Chi Chang
WWW
2007
ACM
14 years 8 months ago
U-REST: an unsupervised record extraction system
In this paper, we describe a system that can extract record structures from web pages with no direct human supervision. Records are commonly occurring HTML-embedded data tuples th...
Yuan Kui Shen, David R. Karger
SIGMOD
2007
ACM
188views Database» more  SIGMOD 2007»
14 years 8 months ago
Intel Mash Maker: join the web
Intel? Mash Maker is an interactive tool that tracks what the user is doing and tries to infer what information and visualizations they might find useful for their current task. M...
Robert Ennals, Eric A. Brewer, Minos N. Garofalaki...
MM
2004
ACM
151views Multimedia» more  MM 2004»
14 years 1 months ago
Grouping web image search result
In this paper, we propose a Web image search result organizing method to facilitate user browsing. We formalize this problem as a salient image region pattern extraction problem. ...
Xin-Jing Wang, Wei-Ying Ma, Qi-Cai He, Xing Li
ITCC
2000
IEEE
14 years 11 days ago
Towards Knowledge Discovery from WWW Log Data
As the result of interactions between visitors and a web site, an http log file contains very rich knowledge about users on-site behaviors, which, if fully exploited, can better c...
Feng Tao, Fionn Murtagh