Sciweavers

259 search results - page 38 / 52
» Towards parameter-free data mining
Sort
View
EDBT
2009
ACM
277views Database» more  EDBT 2009»
14 years 5 days ago
G-hash: towards fast kernel-based similarity search in large graph databases
Structured data including sets, sequences, trees and graphs, pose significant challenges to fundamental aspects of data management such as efficient storage, indexing, and simila...
Xiaohong Wang, Aaron M. Smalter, Jun Huan, Gerald ...
ICDM
2009
IEEE
199views Data Mining» more  ICDM 2009»
14 years 2 months ago
Active Learning with Adaptive Heterogeneous Ensembles
—One common approach to active learning is to iteratively train a single classifier by choosing data points based on its uncertainty, but it is nontrivial to design uncertainty ...
Zhenyu Lu, Xindong Wu, Josh Bongard
ICDM
2008
IEEE
147views Data Mining» more  ICDM 2008»
14 years 2 months ago
Clustering Documents with Active Learning Using Wikipedia
Wikipedia has been applied as a background knowledge base to various text mining problems, but very few attempts have been made to utilize it for document clustering. In this pape...
Anna Huang, David N. Milne, Eibe Frank, Ian H. Wit...
CAISE
2007
Springer
14 years 1 months ago
Declarative XML Data Cleaning with XClean
Data cleaning is the process of correcting anomalies in a data source, that may for instance be due to typographical errors, or duplicate representations of an entity. It is a cruc...
Melanie Weis, Ioana Manolescu
DAWAK
2005
Springer
14 years 1 months ago
Event-Feeded Dimension Solution
Abstract. From the point of view of a data warehouse system its part of collecting and receiving information from other systems is crucial for all subsequent business intelligence ...
Tho Manh Nguyen, Jaromir Nemec, Martin Windisch