Sciweavers

2190 search results - page 150 / 438
» Unweaving a web of documents
Sort
View
DIS
2001
Springer
14 years 3 months ago
Eliminating Useless Parts in Semi-structured Documents Using Alternation Counts
We propose a preprocessing method for Web mining which, given semi-structured documents with the same structure and style, distinguishes useless parts and non-useless parts in each...
Daisuke Ikeda, Yasuhiro Yamada, Sachio Hirokawa
ICDAR
1999
IEEE
14 years 3 months ago
DjVu: Analyzing and Compressing Scanned Documents for Internet Distribution
DjVu is an image compression technique specifically geared towards the compression of scanned documents in color at high resolution. Typical magazine pages in color scanned at 300...
Patrick Haffner, Léon Bottou, Paul G. Howar...
ESWS
2008
Springer
14 years 17 days ago
Combining Fact and Document Retrieval with Spreading Activation for Semantic Desktop Search
Abstract. The Semantic Desktop is a means to support users in Personal Information Management (PIM). It provides an excellent test bed for Semantic Web technology: resources (e. g....
Kinga Schumacher, Michael Sintek, Leo Sauermann
LREC
2008
113views Education» more  LREC 2008»
14 years 7 days ago
Integration of a Multilingual Keyword Extractor in a Document Management System
In this paper we present a new Document Management System called DrStorage. This DMS is multi-platform, JCR-170 compliant, supports WebDav, versioning, user authentication and aut...
Andrea Agili, Marco Fabbri, Alessandro Panunzi, Ma...
ACL
2003
14 years 6 days ago
Orthogonal Negation in Vector Spaces for Modelling Word-Meanings and Document Retrieval
Standard IR systems can process queries such as “web NOT internet”, enabling users who are interested in arachnids to avoid documents about computing. The documents retrieved ...
Dominic Widdows