Sciweavers

498 search results - page 27 / 100
» Robust web content extraction
Sort
View
KES
2004
Springer
14 years 4 days ago
Intelligent Web Site: Understanding the Visitor Behavior
Abstract. Intelligent web site is a new portal generation, able to improve its structure and content based on the analysis of the user behavior. This paper focuses on modeling the ...
Juan D. Velásquez, Pablo A. Estévez,...
SIGIR
2005
ACM
14 years 10 days ago
Title extraction from bodies of HTML documents and its application to web page retrieval
This paper is concerned with automatic extraction of titles from the bodies of HTML documents. Titles of HTML documents should be correctly defined in the title fields; however, i...
Yunhua Hu, Guomao Xin, Ruihua Song, Guoping Hu, Sh...
AAAI
2008
13 years 9 months ago
Extracting Relevant Snippets for Web Navigation
Search engines present fix-length passages from documents ranked by relevance against the query. In this paper, we present and compare novel, language-model based methods for extr...
Qing Li, K. Selçuk Candan, Qi Yan
WWW
2006
ACM
14 years 7 months ago
Toward tighter integration of web search with a geographic information system
Integration of Web search with geographic information has recently attracted much attention. There are a number of local Web search systems enabling users to find locationspecific...
Taro Tezuka, Takeshi Kurashima, Katsumi Tanaka
WWW
2009
ACM
14 years 7 months ago
Extracting article text from the web with maximum subsequence segmentation
Much of the information on the Web is found in articles from online news outlets, magazines, encyclopedias, review collections, and other sources. However, extracting this content...
Jeff Pasternack, Dan Roth