Sciweavers

1380 search results - page 4 / 276
» Combination of Document Priors in Web Information Retrieval
Sort
View
CIKM
2005
Springer
14 years 1 months ago
Document quality models for web ad hoc retrieval
The quality of document content, which is an issue that is usually ignored for the traditional ad hoc retrieval task, is a critical issue for Web search. Web pages have a huge var...
Yun Zhou, W. Bruce Croft
CIKM
2005
Springer
14 years 1 months ago
Web-centric language models
We investigates language models for informational and navigational web search. Retrieval on the web is a task that differs substantially from ordinary ad hoc retrieval. We perfor...
Jaap Kamps
SIGIR
2010
ACM
13 years 12 months ago
Linking wikipedia to the web
We investigate the task of finding links from Wikipedia pages to external web pages. Such external links significantly extend the information in Wikipedia with information from ...
Rianne Kaptein, Pavel Serdyukov, Jaap Kamps
JCDL
2006
ACM
167views Education» more  JCDL 2006»
14 years 1 months ago
Combining DOM tree and geometric layout analysis for online medical journal article segmentation
We describe an HTML web page segmentation algorithm, which is applied to segment online medical journal articles (regular HTML and PDF-Converted-HTML files). The web page content ...
Jie Zou, Daniel X. Le, George R. Thoma
IIWAS
2008
13 years 9 months ago
Combining content extraction heuristics: the CombinE system
The main text content of an HTML document on the WWW is typically surrounded by additional contents, such as navigation menus, advertisements, link lists or design elements. Conte...
Thomas Gottron