Sciweavers

SIGIR
2008
ACM
13 years 11 months ago
A bayesian logistic regression model for active relevance feedback
Relevance feedback, which traditionally uses the terms in the relevant documents to enrich the user's initial query, is an effective method for improving retrieval performanc...
Zuobing Xu, Ram Akella
MTA
2006
122views more  MTA 2006»
13 years 11 months ago
Context-aware design of adaptable multimodal documents
In this paper we present a model and an adaptation architecture for context-aware multimodal documents. A compound virtual document describes the different ways in which multimodal...
Augusto Celentano, Ombretta Gaggi
ML
2006
ACM
13 years 11 months ago
XRules: An effective algorithm for structural classification of XML data
Abstract XML documents have recently become ubiquitous because of their varied applicability in a number of applications. Classification is an important problem in the data mining ...
Mohammed Javeed Zaki, Charu C. Aggarwal
KAIS
2006
102views more  KAIS 2006»
13 years 11 months ago
Visual information extraction
Typographic and visual information is an integral part of textual documents. Most information extraction systems ignore most of this visual information, processing the text as a l...
Yonatan Aumann, Ronen Feldman, Yair Liberzon, Biny...
KAIS
2006
247views more  KAIS 2006»
13 years 11 months ago
XCQ: A queriable XML compression system
XML has already become the de facto standard for specifying and exchanging data on the Web. However, XML is by nature verbose and thus XML documents are usually large in size, a fa...
Wilfred Ng, Wai Yeung Lam, Peter T. Wood, Mark Lev...
KES
2008
Springer
13 years 11 months ago
Data Mining for Navigation Generating System with Unorganized Web Resources
Users prefer to navigate subjects from organized topics in an abundance resources than to list pages retrieved from search engines. We propose a framework to cluster frequent items...
Diana Purwitasari, Yasuhisa Okazaki, Kenzi Watanab...
JUCS
2008
130views more  JUCS 2008»
13 years 11 months ago
Feature Selection for the Classification of Large Document Collections
: Feature selection methods are often applied in the context of document classification. They are particularly important for processing large data sets that may contain millions of...
Janez Brank, Dunja Mladenic, Marko Grobelnik, Nata...
JUCS
2008
138views more  JUCS 2008»
13 years 11 months ago
A New and Efficient Algorithm to Binarize Document Images Removing Back-to-Front Interference
: "Back-to-front interference", "bleeding" and "show-through" is the name given to the phenomenon found whenever documents are written on both sides o...
João Marcelo Monte da Silva, Rafael Dueire ...
JUCS
2008
118views more  JUCS 2008»
13 years 11 months ago
Detailing a Quantitative Method for Assessing Algorithms to Remove Back-to-Front Interference in Documents
: Documents written on both sides on translucent paper make visible the ink from one side on the other. This artefact is called "back-to-front interference", "bleedi...
Rafael Dueire Lins, João Marcelo Monte da S...
JTAER
2008
100views more  JTAER 2008»
13 years 11 months ago
Securing Uniqueness of Rights e-Documents: A Deontic Process Perspective
We typically think of documents as carrying information. However, certain kinds of documents do more than that: they are not only informative but also performative in that they re...
Ronald M. Lee, Vu Nguyen, Anastasia Pagnoni