Sciweavers

2677 search results - page 55 / 536
» Extracting Structured Data from Web Pages
Sort
View
SEMWEB
2007
Springer
14 years 2 months ago
DBpedia: A Nucleus for a Web of Open Data
Abstract DBpedia is a community effort to extract structured information from Wikipedia and to make this information available on the Web. DBpedia allows you to ask sophisticated ...
Sören Auer, Christian Bizer, Georgi Kobilarov...
AINA
2008
IEEE
13 years 10 months ago
Formalization of Link Farm Structure Using Graph Grammar
A link farm is a set of web pages constructed to mislead the importance of target pages in search engine results by boosting their link-based ranking scores. In this paper, we int...
Kiattikun Chobtham, Athasit Surarerks, Arnon Rungs...
SIGMOD
2000
ACM
236views Database» more  SIGMOD 2000»
14 years 1 months ago
XTRACT: A System for Extracting Document Type Descriptors from XML Documents
XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the...
Minos N. Garofalakis, Aristides Gionis, Rajeev Ras...
HUMAN
2005
Springer
14 years 2 months ago
How to Evaluate the Effectiveness of URL Normalizations
Syntactically different URLs could represent the same web page on the World Wide Web, and duplicate representation for web pages causes web applications to handle a large amount of...
Sang Ho Lee, Sung Jin Kim, Hyo Sook Jeong