Search Sciweavers | Sciweavers

472 search results - page 56 / 95

» Crawling the Hidden Web

134

Voted

WSDM
2010
ACM

204views Data Mining» more WSDM 2010»

Learning URL patterns for webpage de-duplication

15 years 10 months ago

Download www.wsdm-conference.org

Presence of duplicate documents in the World Wide Web adversely aﬀects crawling, indexing and relevance, which are the core building blocks of web search. In this paper, we pres...

Hema Swetha Koppula, Krishna P. Leela, Amit Agarwa...

claim paper

Read More »

226

click to vote

ICDE
2008
IEEE

153views Database» more ICDE 2008»

Automatically Extracting Form Labels

16 years 5 months ago

Download www.cs.utah.edu

We describe a machine-learning-based approach for extracting attribute labels from Web form interfaces. Having these labels is a requirement for several techniques that attempt to ...

Hoa Nguyen, Eun Yong Kang, Juliana Freire

claim paper

Read More »

133

click to vote

WWW
2009
ACM

125views Internet Technology» more WWW 2009»

Triplify: light-weight linked data publication from relational databases

16 years 4 months ago

Download www.informatik.uni-leipzig.de

In this paper we present Triplify ? a simplistic but effective approach to publish Linked Data from relational databases. Triplify is based on mapping HTTP-URI requests onto relat...

Sören Auer, Sebastian Dietzold, Jens Lehmann,...

claim paper

Read More »

122

click to vote

SAC
2005
ACM

138views Applied Computing» more SAC 2005»

Pollock: automatic generation of virtual web services from web sites

15 years 9 months ago

Download pike.psu.edu

As the usage of Web Services proliferates dramatically, new tools to help quickly generate web services are needed. In this paper, we propose a methodology that helps to automatic...

Yi-Hsuan Lu, Yoojin Hong, Jinesh Varia, Dongwon Le...

claim paper

Read More »

130

click to vote

IJWIS
2007

77views more IJWIS 2007»

World's first web census

15 years 3 months ago

Download cs.acadiau.ca

: Purpose — To measure the exact size of the World Wide Web (i.e., a census). The measure used is the number of publicly accessible web servers on port 80. Design/methodology/app...

Darcy G. Benoit, André Trudel

claim paper

Read More »

« Prev « First page 56 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers