Sciweavers

74 search results - page 6 / 15
» Towards The Web of Concepts: Extracting Concepts from Large ...
Sort
View
WWW
2009
ACM
14 years 8 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth
ICADL
2010
Springer
160views Education» more  ICADL 2010»
14 years 14 days ago
Thesaurus Extension Using Web Search Engines
Maintaining and extending large thesauri is an important challenge facing digital libraries and IT businesses alike. In this paper we describe a method building on and extending ex...
Robert Meusel, Mathias Niepert, Kai Eckert, Heiner...
WEBDB
2010
Springer
156views Database» more  WEBDB 2010»
14 years 22 days ago
Redundancy-Driven Web Data Extraction and Integration
A large number of web sites publish pages containing structured information about recognizable concepts, but these data are only partially used by current applications. Although s...
Paolo Papotti, Valter Crescenzi, Paolo Merialdo, M...
ITCC
2000
IEEE
14 years 1 days ago
Towards Knowledge Discovery from WWW Log Data
As the result of interactions between visitors and a web site, an http log file contains very rich knowledge about users on-site behaviors, which, if fully exploited, can better c...
Feng Tao, Fionn Murtagh
ASWC
2008
Springer
13 years 9 months ago
SAOR: Authoritative Reasoning for the Web
Abstract. In this paper we discuss the challenges of performing reasoning on large scale RDF datasets from the Web. We discuss issues and practical solutions relating to reasoning ...
Aidan Hogan, Andreas Harth, Axel Polleres