Sciweavers

591 search results - page 27 / 119
» Extracting Route Directions from Web Pages
Sort
View
ICDM
2007
IEEE
149views Data Mining» more  ICDM 2007»
14 years 4 months ago
Extracting Author Meta-Data from Web Using Visual Features
Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...
Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles
BMCBI
2008
91views more  BMCBI 2008»
13 years 10 months ago
PageRank without hyperlinks: Reranking with PubMed related article networks for biomedical text retrieval
Background: Graph analysis algorithms such as PageRank and HITS have been successful in Web environments because they are able to extract important inter-document relationships fr...
Jimmy J. Lin
PERCOM
2005
ACM
14 years 9 months ago
Efficient Browsing of Web Search Results on Mobile Devices Based on Block Importance Model
It is expected that more and more people will search the web when they are on the move. Though conventional search engines can be directly visited from mobile devices with web bro...
Xing Xie, Gengxin Miao, Ruihua Song, Ji-Rong Wen, ...
WEBDB
1999
Springer
196views Database» more  WEBDB 1999»
14 years 2 months ago
Web Ecology: Recycling HTML Pages as XML Documents Using W4F
In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...
Arnaud Sahuguet, Fabien Azavant
LREC
2010
216views Education» more  LREC 2010»
13 years 11 months ago
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...
Georgios Petasis, Dimitrios Petasis