Sciweavers

DELOS
2000
14 years 24 days ago
Intelligent Information Retrieval in a Digital Library Service
We present the Private Digital Library (PDL) project that represents a service of the Corporate Digital Library (CDL) prototype. The main ideas underlying this project are the foll...
Giovanni Semeraro, Fabio Abbattista, Nicola Fanizz...
WEBNET
2001
14 years 25 days ago
Deriving Context Specific Information on the Web
: The Web is huge, unstructured and diverse in quality, which makes searching for information difficult. In practice, few of the documents returned by a search engine are valuable ...
Christo Dichev, Darina Dicheva
SSWMC
2004
14 years 25 days ago
Signature-embedding in printed documents for security and forensic applications
Despite the increase in email and other forms of digital communication, the use of printed documents continues to increase every year. Many types of printed documents need to be &...
Aravind K. Mikkilineni, Gazi N. Ali, Pei-Ju Chiang...
IJCAI
2003
14 years 25 days ago
Learning to Classify Texts Using Positive and Unlabeled Data
In traditional text classification, a classifier is built using labeled training documents of every class. This paper studies a different problem. Given a set P of documents of a ...
Xiaoli Li, Bing Liu
IJCAI
2003
14 years 25 days ago
A semantic framework for multimedia document adaptation
With the proliferation of heterogeneous devices (desktop computers, personal digital assistants, phones), multimedia documents must be played under various constraints (small scre...
Jérôme Euzenat, Nabil Layaïda, V...
IIS
2003
14 years 25 days ago
Adaptive Translation between User's Vocabulary and Internet Queries
The paper starts with a short overview on areas of application for user profiles. Subsequently a method to represent user profile in the field of document retrieval by using que...
Agnieszka Indyka-Piasecka, Maciej Piasecki
SDM
2004
SIAM
174views Data Mining» more  SDM 2004»
14 years 25 days ago
Classifying Documents Without Labels
Automatic classification of documents is an important area of research with many applications in the fields of document searching, forensics and others. Methods to perform classif...
Daniel Barbará, Carlotta Domeniconi, Ning K...
PICS
2001
14 years 25 days ago
Reduction of Bleed-through in Scanned Manuscript Documents
Many old manuscript documents were written on both sides of the paper, and the bleed-through from one side of the document to the other increases the difficulty in reading or deci...
Eric Dubois, Anita Pathak
ECIR
2003
Springer
14 years 25 days ago
Hierarchical Classification of HTML Documents with WebClassII
This paper describes a new method for the classification of a HTML document into a hierarchy of categories. The hierarchy of categories is involved in all phases of automated docum...
Michelangelo Ceci, Donato Malerba
FLAIRS
2006
14 years 25 days ago
Corpus Based Unsupervised Labeling of Documents
Text categorization involves mapping of documents to a fixed set of labels. A similar but equally important problem is that of assigning labels to large corpora. With a deluge of ...
Delip Rao, Deepak P, Deepak Khemani