Search Sciweavers | Sciweavers

591 search results - page 20 / 119

» Extracting Route Directions from Web Pages

197

click to vote

KDD
2002
ACM

170views Data Mining» more KDD 2002»

Web site mining: a new way to spot competitors, customers and suppliers in the world wide web

16 years 7 months ago

Download www.cs.sfu.ca

When automatically extracting information from the world wide web, most established methods focus on spotting single HTMLdocuments. However, the problem of spotting complete web s...

Martin Ester, Hans-Peter Kriegel, Matthias Schuber...

claim paper

Read More »

216

click to vote

COOPIS
1999
IEEE

107views Information Technology» more COOPIS 1999»

Looking at the Web through XML Glasses

15 years 11 months ago

Download db.cis.upenn.edu

The Web so far has been incredibly successful at delivering information to human users. So successful actually, that there is now an urgent need to go beyond a browsing human and ...

Arnaud Sahuguet, Fabien Azavant

claim paper

Read More »

195

click to vote

CIKM
2006
Springer

186views Information Technology» more CIKM 2006»

A fast and robust method for web page template detection and removal

15 years 11 months ago

Download www.cs.utah.edu

The widespread use of templates on the Web is considered harmful for two main reasons. Not only do they compromise the relevance judgment of many web IR and web mining methods suc...

Karane Vieira, Altigran Soares da Silva, Nick Pint...

claim paper

Read More »

195

Voted

SIGIR
2000
ACM

160views Information Technology» more SIGIR 2000»

OCELOT: a system for summarizing Web pages

15 years 11 months ago

Download www.cs.cmu.edu

Abstract We introduce OCELOT, a prototype system for automatically generating the “gist” of a web page by summarizing it. Although most text summarization research to date has ...

Adam L. Berger, Vibhu O. Mittal

claim paper

Read More »

161

Voted

HUMAN
2005
Springer

144views Social Sciences» more HUMAN 2005»

How to Evaluate the Effectiveness of URL Normalizations

16 years 27 days ago

Download dblab.ssu.ac.kr

Syntactically different URLs could represent the same web page on the World Wide Web, and duplicate representation for web pages causes web applications to handle a large amount of...

Sang Ho Lee, Sung Jin Kim, Hyo Sook Jeong

claim paper

Read More »

« Prev « First page 20 / 119 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers