Search Sciweavers | Sciweavers

1768 search results - page 113 / 354

» Mining Very Large Databases

256

click to vote

KDD
2012
ACM

205views Data Mining» more KDD 2012»

Searching and mining trillions of time series subsequences under dynamic time warping

13 years 10 months ago

Download www.cs.ucr.edu

Most time series data mining algorithms use similarity search as a core subroutine, and thus the time taken for similarity search is the bottleneck for virtually all time series d...

Thanawin Rakthanmanon, Bilson J. L. Campana, Abdul...

claim paper

Read More »

318

click to vote

ICDE
2009
IEEE

121views Database» more ICDE 2009»

Large-Scale Deduplication with Constraints Using Dedupalog

16 years 9 months ago

Download pages.cs.wisc.edu

We present a declarative framework for collective deduplication of entity references in the presence of constraints. Constraints occur naturally in many data cleaning domains and c...

Arvind Arasu, Christopher Ré, Dan Suciu

claim paper

Read More »

169

click to vote

EUSFLAT
2007

105views Fuzzy Logic» more EUSFLAT 2007»

SPoID: Do Not Throw Meaningful Incomplete Sequences Away!

15 years 9 months ago

Download www.eusflat.org

Industrial databases often contain a large amount of unﬁlled information. During the knowledge discovery process one processing step is often necessary in order to remove these ...

Céline Fiot, Anne Laurent, Maguelonne Teiss...

claim paper

Read More »

196

click to vote

BIRTHDAY
2005
Springer

132views Applied Computing» more BIRTHDAY 2005»

Toward Automated Large-Scale Information Integration and Discovery

16 years 1 months ago

Download www.almaden.ibm.com

The high cost of data consolidation is the key market inhibitor to the adoption of traditional information integration and data warehousing solutions. In this paper, we outline a n...

Paul Brown, Peter J. Haas, Jussi Myllymaki, Hamid ...

claim paper

Read More »

281

click to vote

VLDB
2007
ACM

128views Database» more VLDB 2007»

Periscope/SQ: Interactive Exploration of Biological Sequence Databases

16 years 7 months ago

Download www.vldb.org

Life science laboratories today have to rely on procedural techniques to store and manage large sequence datasets. Procedural techniques are cumbersome to use and are often very i...

Sandeep Tata, Willis Lang, Jignesh M. Patel

claim paper

Read More »

« Prev « First page 113 / 354 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers