Search Sciweavers | Sciweavers

1052 search results - page 55 / 211

» Improved CHAID algorithm for document structure modelling

198

click to vote

ICDAR
2009
IEEE

178views Document Analysis» more ICDAR 2009»

Text Lines and Snippets Extraction for 19th Century Handwriting Documents Layout Analysis

16 years 1 months ago

Download liris.cnrs.fr

In this paper we propose a new approach to improve electronic editions of human science corpus, providing an efﬁcient estimation of manuscripts pages structure. In any handwriti...

Vincent Malleron, Véronique Eglin, Hubert E...

claim paper

Read More »

133

click to vote

ACL
2008

106views Computational Linguistics» more ACL 2008»

Learning Bigrams from Unigrams

15 years 8 months ago

Download aclweb.org

Traditional wisdom holds that once documents are turned into bag-of-words (unigram count) vectors, word orders are completely lost. We introduce an approach that, perhaps surprisi...

Xiaojin Zhu, Andrew B. Goldberg, Michael Rabbat, R...

claim paper

Read More »

202

click to vote

ICDE
2005
IEEE

122views Database» more ICDE 2005»

Signature-based Filtering Techniques for Structural Joins of XML Data

16 years 18 days ago

Download web.mst.edu

Queries on XML documents typically combine selections on element contents, and, via path expressions, the structural relationships between tagged elements. Efﬁcient support for ...

Huan Huo, Guoren Wang, Chuan Yang, Rui Zhou

claim paper

Read More »

341

click to vote

ICDE
2003
IEEE

143views Database» more ICDE 2003»

Index-Based Approximate XML Joins

16 years 8 months ago

Download www.research.att.com

XML data integration tools are facing a variety of challenges for their efficient and effective operation. Among these is the requirement to handle a variety of inconsistencies or...

Sudipto Guha, Nick Koudas, Divesh Srivastava, Ting...

claim paper

Read More »

164

click to vote

IPM
2006

77views more IPM 2006»

A general matrix framework for modelling Information Retrieval

15 years 7 months ago

Download www.dcs.vein.hu

Content-oriented retrieval models are based on a document-term matrix, whereas link-oriented retrieval models are based on an adjacent (parentchild) matrix. Term frequency and inv...

Thomas Rölleke, Theodora Tsikrika, Gabriella ...

claim paper

Read More »

« Prev « First page 55 / 211 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers