Search Sciweavers | Sciweavers

30

DIS
2001
Springer

93views Theoretical Computer Science» more DIS 2001»

Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts

14 years 3 months ago

We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...

Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa

claim paper

Read More »

51

click to vote

ICDAR
1999
IEEE

140views Document Analysis» more ICDAR 1999»

DjVu: Analyzing and Compressing Scanned Documents for Internet Distribution

14 years 3 months ago

Download yann.lecun.com

DjVu is an image compression technique specifically geared towards the compression of scanned documents in color at high resolution. Typical magazine pages in color scanned at 300...

Patrick Haffner, Léon Bottou, Paul G. Howar...

claim paper

Read More »

32

click to vote

ESWS
2008
Springer

103views Internet Technology» more ESWS 2008»

Combining Fact and Document Retrieval with Spreading Activation for Semantic Desktop Search

14 years 17 days ago

Download www.dfki.uni-kl.de

Abstract. The Semantic Desktop is a means to support users in Personal Information Management (PIM). It provides an excellent test bed for Semantic Web technology: resources (e. g....

Kinga Schumacher, Michael Sintek, Leo Sauermann

claim paper

Read More »

35

click to vote

LREC
2008

113views Education» more LREC 2008»

Integration of a Multilingual Keyword Extractor in a Document Management System

14 years 7 days ago

Download www.lrec-conf.org

In this paper we present a new Document Management System called DrStorage. This DMS is multi-platform, JCR-170 compliant, supports WebDav, versioning, user authentication and aut...

Andrea Agili, Marco Fabbri, Alessandro Panunzi, Ma...

claim paper

Read More »

29

click to vote

ACL
2003

123views Computational Linguistics» more ACL 2003»

Orthogonal Negation in Vector Spaces for Modelling Word-Meanings and Document Retrieval

14 years 6 days ago

Download acl.ldc.upenn.edu

Standard IR systems can process queries such as “web NOT internet”, enabling users who are interested in arachnids to avoid documents about computing. The documents retrieved ...

Dominic Widdows

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers