Search Sciweavers | Sciweavers

563 search results - page 46 / 113

» Crawling the web for structured documents

121

click to vote

ICML
2005
IEEE

126views Machine Learning» more ICML 2005»

Hierarchical Dirichlet model for document classification

16 years 4 months ago

Download www.machinelearning.org

The proliferation of text documents on the web as well as within institutions necessitates their convenient organization to enable efficient retrieval of information. Although tex...

Sriharsha Veeramachaneni, Diego Sona, Paolo Avesan...

claim paper

Read More »

121

Voted

FEGC
2006

92views Biometrics» more FEGC 2006»

Maintaining an Online Bibliographical Database: The Problem of Data Quality

15 years 5 months ago

Download dblp.uni-trier.de

CiteSeer and Google-Scholar are huge digital libraries which provide access to (computer-)science publications. Both collections are operated like specialized search engines, they ...

Michael Ley, Patrick Reuther

claim paper

Read More »

143

click to vote

ICTIR
2009
Springer

137views Information Technology» more ICTIR 2009»

What's in a Link? From Document Importance to Topical Relevance

15 years 10 months ago

Download staff.science.uva.nl

Web information retrieval is best known for its use of the Web’s link structure as a source of evidence. Global link evidence is by nature query-independent, and is therefore no ...

Marijn Koolen, Jaap Kamps

claim paper

Read More »

122

click to vote

IEEESCC
2008
IEEE

110views Applied Computing» more IEEESCC 2008»

Exploiting XML Schema for Interpreting XML Documents as RDF

15 years 10 months ago

Download uclab.khu.ac.kr

Interpreting legacy XML documents is a great challenge for realizing the vision of the Semantic Web (SW). This paper presents an algorithm to transform XML data into RDF- foundati...

Pham Thi Thu Thuy, Young-Koo Lee, Sungyoung Lee, B...

claim paper

Read More »

118

click to vote

DOCENG
2007
ACM

105views Document Analysis» more DOCENG 2007»

Editing with style

15 years 5 months ago

Download hal.archives-ouvertes.fr

HTML has popularized the use of style sheets, and the advent of XML has stressed the importance of style as a key area complementing document structure and content. A number of to...

Vincent Quint, Irène Vatton

claim paper

Read More »

« Prev « First page 46 / 113 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers