Sciweavers

169 search results - page 13 / 34
» From Web Data to Entities and Back
Sort
View
ICDE
2007
IEEE
155views Database» more  ICDE 2007»
14 years 9 months ago
Collaborative Wrapping: A Turbo Framework for Web Data Extraction
To access data sources on the Web, a crucial step is wrapping, which translates query responses, rendered in textual HTML, back into their relational form. Traditionally, this pro...
Shui-Lung Chuang, Kevin Chen-Chuan Chang, ChengXia...
TAL
2010
Springer
13 years 6 months ago
Portable Extraction of Partially Structured Facts from the Web
A novel fact extraction task is defined to fill a gap between current information retrieval and information extraction technologies. It is shown that it is possible to extract usef...
Andrew Salway, Liadh Kelly, Inguna Skadina, Gareth...
ADMA
2006
Springer
150views Data Mining» more  ADMA 2006»
14 years 1 months ago
Web Scale Competitor Discovery Using Mutual Information
Abstract. The web with its rapid expansion has become an excellent resource for gathering information and people’s opinion. A company owner wants to know who is the competitor, a...
Rui Li, Shenghua Bao, Jin Wang, Yuanjie Liu, Yong ...
WWW
2010
ACM
13 years 11 months ago
Enabling entity-based aggregators for web 2.0 data
Selecting and presenting content culled from multiple heterogeneous and physically distributed sources is a challenging task. The exponential growth of the web data in modern time...
Ekaterini Ioannou, Claudia Niederée, Yannis...
CIDR
2011
243views Algorithms» more  CIDR 2011»
12 years 11 months ago
Longitudinal Analytics on Web Archive Data: It's About Time!
Organizations like the Internet Archive have been capturing Web contents over decades, building up huge repositories of time-versioned pages. The timestamp annotations and the she...
Gerhard Weikum, Nikos Ntarmos, Marc Spaniol, Peter...