Sciweavers

161 search results - page 7 / 33
» Counting Distinct Strings
Sort
View
COLING
1996
13 years 10 months ago
The Automatic Extraction of Open Compounds from Text Corpora
This paper describes a new method for extracting open compounds (uninterrupted sequences of words) from text corpora of languages, such as Thai, Japanese and Korea that exhibit un...
Virach Sornlertlamvanich, Hozumi Tanaka
AMSTERDAM
2009
13 years 6 months ago
The Semantics of Count Nouns
This paper offers an account of the semantics of count nous. I show that neither the atomic/nonatomic distinction nor the homogeneous/non-homogeneous distinction is sufficient to ...
Susan Rothstein
ICDAR
1999
IEEE
14 years 25 days ago
Models and Algorithms for Duplicate Document Detection
This paper introduces a framework for clarifying and formalizing the duplicate document detection problem. Four distinct models are presented, each with a corresponding algorithm ...
Daniel P. Lopresti
JCT
2011
66views more  JCT 2011»
13 years 3 months ago
Enumeration of non-crossing pairings on bit strings
A non-crossing pairing on a bitstring matches 1s and 0s in a manner such that the pairing diagram is nonintersecting. By considering such pairings on arbitrary bitstrings 1n1 0m1 ....
Todd Kemp, Karl Mahlburg, Amarpreet Rattan, Cliffo...
TOIS
2002
97views more  TOIS 2002»
13 years 8 months ago
Burst tries: a fast, efficient data structure for string keys
Many applications depend on efficient management of large sets of distinct strings in memory. For example, during index construction for text databases a record is held for each d...
Steffen Heinz, Justin Zobel, Hugh E. Williams