Query Relaxation by Structure and Semantics for Retrieval of Logical Web Documents

15 years 5 months ago

Download www.public.asu.edu

Since WWW encourages hypertext and hypermedia document authoring (e.g. HTML or XML), Web authors tend to create documents that are composed of multiple pages connected with hyperlinks. A Web document may be authored in multiple ways, such as (1) all information in one physical page, or (2) a main page and the related information in separate linked pages. Existing Web search engines, however, return only physical pages containing keywords. In this paper, we introduce the concept of information unit, which can be viewed as a logical Web document consisting of multiple physical pages as one atomic retrieval unit. We present an algorithm to efficiently retrieve information units. Our algorithm can perform progressive query processing. These functionalities are essential for information retrieval on the Web and a large XML database. We also present experimental results on synthetic graphs and real Web data. Keywords. Web proximity search, link structures, query relaxation, progressive proc...

Wen-Syan Li, K. Selçuk Candan, Quoc Vu, Div

Real-time Traffic

Document | Pages | Physical Pages | TKDE 2002 |

claim paper

» A Unified Approach to Retrieving Web Documents and Semantic Web Data

» Relaxing XML Preference Queries for Cooperative Retrieval

» Incremental Query Answering for Implementing Document Retrieval Services

» Optimization Techniques for Retrieving Resources Described in OWLRDF Documents First Resul...

» Indexing Documents by Discourse and Semantic Contents from Automatic Annotations of Texts

» Narrowing the semantic gap improved textbased web document retrieval using visual feature...

» A Methodology to Create OntologyBased Information Retrieval Systems

» An Ontology for Domainoriented Semantic Similarity Search on XML Data

Post Info
More Details (n/a)

Added	23 Dec 2010
Updated	23 Dec 2010
Type	Journal
Year	2002
Where	TKDE
Authors	Wen-Syan Li, K. Selçuk Candan, Quoc Vu, Divyakant Agrawal

Comments (0)

Sciweavers

Query Relaxation by Structure and Semantics for Retrieval of Logical Web Documents

Document | Pages | Physical Pages | TKDE 2002 |

Explore & Download

Productivity Tools

Sciweavers