Sciweavers

CICLING
2001
Springer
14 years 5 months ago
Chi-Square Classifier for Document Categorization
The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply...
Mikhail Alexandrov, Alexander F. Gelbukh, George L...
SAC
2009
ACM
14 years 5 months ago
Discovering XML keys and foreign keys in queries
The XML has undoubtedly become a standard for data representation and manipulation. But most of XML documents are still created without the respective description of their structu...
Martin Necaský, Irena Mlýnková...
IEEEICCI
2002
IEEE
14 years 5 months ago
An Agent-Assisted Document Storage for Software Process Environments
Traditional software process environment stores documents using either centralized or distributed approach. With the assistance of web agent, this paper presents a new document st...
Jason Jen-Yen Chen, Chun-Yi Lin
ICPPW
2002
IEEE
14 years 5 months ago
Hebbian Algorithms for a Digital Library Recommendation System
generally meta-data, so that documents on any specific subject can be transparently retrieved. While quality control can in principle still rely on the traditional methods of peer-...
Francis Heylighen, Johan Bollen
ICDM
2002
IEEE
162views Data Mining» more  ICDM 2002»
14 years 5 months ago
Phrase-based Document Similarity Based on an Index Graph Model
Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...
Khaled M. Hammouda, Mohamed S. Kamel
WEBDB
2010
Springer
152views Database» more  WEBDB 2010»
14 years 5 months ago
Reconciling two models of multihierarchical markup
For documents with complex or atypical annotations, multihierarchical structures play the role of the document tree in traditional XML documents. We define a model of overlapping...
Neil Moore
ISCIS
2003
Springer
14 years 5 months ago
Comparison of New Simple Weighting Functions for Web Documents against Existing Methods
Abstract. Term weighting is one of the most important aspects of modern Web retrieval systems. The weight associated with a given term in a document shows the importance of the ter...
Byurhan Hyusein, Ahmed Patel, Ferad Zyulkyarov
DBPL
2003
Springer
73views Database» more  DBPL 2003»
14 years 5 months ago
Updates and Incremental Validation of XML Documents
We consider the incremental validation of updates on XML documents. When a valid XML document (i.e., one satisfying some constraints) is updated, it has to be verified that the n...
Béatrice Bouchou, Mirian Halfeld Ferrari Al...
CIKM
2003
Springer
14 years 5 months ago
Online duplicate document detection: signature reliability in a dynamic retrieval environment
As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. Few users wish to retri...
Jack G. Conrad, Xi S. Guo, Cindy P. Schriber
SIGIR
2003
ACM
14 years 5 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann