Sciweavers

1380 search results - page 28 / 276
» Combination of Document Priors in Web Information Retrieval
Sort
View
CLEF
2009
Springer
13 years 9 months ago
Prior Art Search Using International Patent Classification Codes and All-Claims-Queries
In this study, we describe our system at the Intellectual Property track of the 2009 CrossLanguage Evaluation Forum campaign (CLEF-IP). The CLEF-IP track addressed prior art searc...
Benjamin Herbert, György Szarvas, Iryna Gurev...
SIGIR
2004
ACM
14 years 1 months ago
Query-related data extraction of hidden web documents
The larger amount of information on the Web is stored in document databases and is not indexed by general-purpose search engines (i.e., Google and Yahoo). Such information is dyna...
Yih-Ling Hedley, Muhammad Younas, Anne E. James, M...
CIKM
2003
Springer
14 years 1 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
ISDA
2009
IEEE
14 years 2 months ago
A Combined Query Expansion Technique for Retrieving Opinions from Blogs
In this paper, we discuss the the role of the retrieval component in an TREC style opinion question answering system. Since blog retrieval differs from traditional ad-hoc document...
Saeedeh Momtazi, Stefan Kazalski, Dietrich Klakow
WWW
2003
ACM
14 years 8 months ago
Improving pseudo-relevance feedback in web information retrieval using web page segmentation
In contrast to traditional document retrieval, a web page as a whole is not a good information unit to search because it often contains multiple topics and a lot of irrelevant inf...
Shipeng Yu, Deng Cai, Ji-Rong Wen, Wei-Ying Ma