document | Sciweavers

162

CICLING
2001
Springer

119views Natural Language Processing» more CICLING 2001»

Chi-Square Classifier for Document Categorization

15 years 11 months ago

The problem of document categorization is considered. The set of domains and the keywords specific for these domains is supposed to be selected beforehand as initial data. We apply...

Mikhail Alexandrov, Alexander F. Gelbukh, George L...

claim paper

Read More »

177

click to vote

SAC
2009
ACM

123views Applied Computing» more SAC 2009»

Discovering XML keys and foreign keys in queries

15 years 11 months ago

Download www.ksi.mff.cuni.cz

The XML has undoubtedly become a standard for data representation and manipulation. But most of XML documents are still created without the respective description of their structu...

Martin Necaský, Irena Mlýnková...

claim paper

Read More »

176

click to vote

IEEEICCI
2002
IEEE

97views Artificial Intelligence» more IEEEICCI 2002»

An Agent-Assisted Document Storage for Software Process Environments

15 years 11 months ago

Download www.semgrid.net

Traditional software process environment stores documents using either centralized or distributed approach. With the assistance of web agent, this paper presents a new document st...

Jason Jen-Yen Chen, Chun-Yi Lin

claim paper

Read More »

165

click to vote

ICPPW
2002
IEEE

168views Distributed And Parallel Com...» more ICPPW 2002»

Hebbian Algorithms for a Digital Library Recommendation System

15 years 11 months ago

Download pespmc1.vub.ac.be

generally meta-data, so that documents on any specific subject can be transparently retrieved. While quality control can in principle still rely on the traditional methods of peer-...

Francis Heylighen, Johan Bollen

claim paper

Read More »

170

click to vote

ICDM
2002
IEEE

162views Data Mining» more ICDM 2002»

Phrase-based Document Similarity Based on an Index Graph Model

15 years 11 months ago

Download pami.uwaterloo.ca

Document clustering techniques mostly rely on single term analysis of the document data set, such as the Vector Space Model. To better capture the structure of documents, the unde...

Khaled M. Hammouda, Mohamed S. Kamel

claim paper

Read More »

156

click to vote

WEBDB
2010
Springer

152views Database» more WEBDB 2010»

Reconciling two models of multihierarchical markup

15 years 11 months ago

Download webdb2010.org

For documents with complex or atypical annotations, multihierarchical structures play the role of the document tree in traditional XML documents. We deﬁne a model of overlapping...

Neil Moore

claim paper

Read More »

152

click to vote

ISCIS
2003
Springer

127views Information Technology» more ISCIS 2003»

Comparison of New Simple Weighting Functions for Web Documents against Existing Methods

15 years 11 months ago

Download www.feradz.com

Abstract. Term weighting is one of the most important aspects of modern Web retrieval systems. The weight associated with a given term in a document shows the importance of the ter...

Byurhan Hyusein, Ahmed Patel, Ferad Zyulkyarov

claim paper

Read More »

177

click to vote

DBPL
2003
Springer

73views Database» more DBPL 2003»

Updates and Incremental Validation of XML Documents

15 years 11 months ago

Download www.info.univ-tours.fr

We consider the incremental validation of updates on XML documents. When a valid XML document (i.e., one satisfying some constraints) is updated, it has to be veriﬁed that the n...

Béatrice Bouchou, Mirian Halfeld Ferrari Al...

claim paper

Read More »

167

click to vote

CIKM
2003
Springer

130views Information Technology» more CIKM 2003»

Online duplicate document detection: signature reliability in a dynamic retrieval environment

15 years 11 months ago

Download www.conradweb.org

As online document collections continue to expand, both on the Web and in proprietary environments, the need for duplicate detection becomes more critical. Few users wish to retri...

Jack G. Conrad, Xi S. Guo, Cindy P. Schriber

claim paper

Read More »

202

click to vote

SIGIR
2003
ACM

147views Information Technology» more SIGIR 2003»

Text categorization by boosting automatically extracted concepts

15 years 11 months ago

Download www.cs.brown.edu

Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...

Lijuan Cai, Thomas Hofmann

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers