Search Sciweavers | Sciweavers

563 search results - page 59 / 113

» Crawling the web for structured documents

112

click to vote

WWW
2005
ACM

173views Internet Technology» more WWW 2005»

Automatically learning document taxonomies for hierarchical classification

16 years 4 months ago

Download www.ideal.ece.utexas.edu

While several hierarchical classification methods have been applied to web content, such techniques invariably rely on a pre-defined taxonomy of documents. We propose a new techni...

Kunal Punera, Suju Rajan, Joydeep Ghosh

claim paper

Read More »

117

click to vote

DIS
2001
Springer

93views Theoretical Computer Science» more DIS 2001»

Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts

15 years 8 months ago

Download www.i.kyushu-u.ac.jp

We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...

Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa

claim paper

Read More »

134

click to vote

WEBDB
2004
Springer

125views Database» more WEBDB 2004»

Best-Match Querying from Document-Centric XML

15 years 9 months ago

Download webdb2004.cs.columbia.edu

On the Web, there is a pervasive use of XML to give lightweight semantics to textual collections. Such documentcentric XML collections require a query language that can gracefully...

Jaap Kamps, Maarten Marx, Maarten de Rijke, Bö...

claim paper

Read More »

113

click to vote

SIGDOC
2006
ACM

104views Document Analysis» more SIGDOC 2006»

Taming the inaccessible web

15 years 10 months ago

Download www.simonharper.info

Visually impaired users are hindered in their efforts to access the largest repository of electronic information in the world, namely the World Wide Web (Web). A visually impaired...

Simon Harper, Sean Bechhofer, Darren Lunn

claim paper

Read More »

159

click to vote

KDD
2005
ACM

194views Data Mining» more KDD 2005»

Web object indexing using domain knowledge

16 years 4 months ago

Download research.microsoft.com

Web object is defined to represent any meaningful object embedded in web pages (e.g. images, music) or pointed to by hyperlinks (e.g. downloadable files). Users usually search for...

Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiya...

claim paper

Read More »

« Prev « First page 59 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers