Search Sciweavers | Sciweavers

591 search results - page 27 / 119

» Extracting Route Directions from Web Pages

214

click to vote

ICDM
2007
IEEE

149views Data Mining» more ICDM 2007»

Extracting Author Meta-Data from Web Using Visual Features

16 years 1 months ago

Download www.cse.psu.edu

Enriching digital library’s author meta-data can lead to valuable services and applications. This paper addresses the problem of extracting authors’ information from their hom...

Shuyi Zheng, Ding Zhou, Jia Li, C. Lee Giles

claim paper

Read More »

179

Voted

BMCBI
2008

91views more BMCBI 2008»

PageRank without hyperlinks: Reranking with PubMed related article networks for biomedical text retrieval

15 years 7 months ago

Download www.biomedcentral.com

Background: Graph analysis algorithms such as PageRank and HITS have been successful in Web environments because they are able to extract important inter-document relationships fr...

Jimmy J. Lin

claim paper

Read More »

262

Voted

PERCOM
2005
ACM

208views Computer Networks» more PERCOM 2005»

Efficient Browsing of Web Search Results on Mobile Devices Based on Block Importance Model

16 years 7 months ago

Download research.microsoft.com

It is expected that more and more people will search the web when they are on the move. Though conventional search engines can be directly visited from mobile devices with web bro...

Xing Xie, Gengxin Miao, Ruihua Song, Ji-Rong Wen, ...

claim paper

Read More »

222

click to vote

WEBDB
1999
Springer

196views Database» more WEBDB 1999»

Web Ecology: Recycling HTML Pages as XML Documents Using W4F

15 years 11 months ago

Download db.cis.upenn.edu

In this paper we present the World-Wide Web Wrapper Factory (W4F), a Java toolkit to generate wrappers for Web data sources. Some key features of W4F are an expressive language to...

Arnaud Sahuguet, Fabien Azavant

claim paper

Read More »

174

Voted

LREC
2010

216views Education» more LREC 2010»

BlogBuster: A Tool for Extracting Corpora from the Blogosphere

15 years 9 months ago

Download www.lrec-conf.org

This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...

Georgios Petasis, Dimitrios Petasis

claim paper

Read More »

« Prev « First page 27 / 119 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers