Sciweavers

1756 search results - page 264 / 352
» Mining Query Logs
Sort
View
SODA
2012
ACM
174views Algorithms» more  SODA 2012»
12 years 14 days ago
Using hashing to solve the dictionary problem
We consider the dictionary problem in external memory and improve the update time of the wellknown buffer tree by roughly a logarithmic factor. For any λ ≥ max{lg lg n, logM/B(...
John Iacono, Mihai Patrascu
ADC
2005
Springer
122views Database» more  ADC 2005»
14 years 3 months ago
Finding Similarity in Time Series Data by Method of Time Weighted Moments
Similarity search in time series data is an active area of research in data mining. In this paper we introduce a new approach for performing similarity search over time series dat...
Durga Toshniwal, Ramesh C. Joshi
CIKM
1999
Springer
14 years 2 months ago
Automatically Extracting Structure and Data from Business Reports
A considerable amount of clean semistructured data is internally available to companies in the form of business reports. However, business reports are untapped for data mining, da...
Stephen W. Liddle, Douglas M. Campbell, Chad Crawf...
ANLP
2000
163views more  ANLP 2000»
13 years 11 months ago
Automatic construction of parallel English-Chinese corpus for cross-language information retrieval
A major obstacle to the construction of a probabilistic translation model is the lack of large parallel corpora. In this paper we first describe a parallel text mining system that...
Jiang Chen, Jian-Yun Nie
KDD
2009
ACM
227views Data Mining» more  KDD 2009»
14 years 10 months ago
Efficiently learning the accuracy of labeling sources for selective sampling
Many scalable data mining tasks rely on active learning to provide the most useful accurately labeled instances. However, what if there are multiple labeling sources (`oracles...
Pinar Donmez, Jaime G. Carbonell, Jeff Schneider