Sciweavers

588 search results - page 100 / 118
» Discovering data quality rules
Sort
View
BPM
2008
Springer
192views Business» more  BPM 2008»
13 years 9 months ago
Trace Clustering in Process Mining
Process mining has proven to be a valuable tool for analyzing operational process executions based on event logs. Existing techniques perform well on structured processes, but stil...
Minseok Song, Christian W. Günther, Wil M. P....
DKE
2007
199views more  DKE 2007»
13 years 7 months ago
QMatch - Using paths to match XML schemas
Integration of multiple heterogeneous data sources continues to be a critical problem for many application domains and a challenge for researchers world-wide. With the increasing ...
Naiyana Tansalarak, Kajal T. Claypool
PRL
2010
130views more  PRL 2010»
13 years 6 months ago
Automatic configuration of spectral dimensionality reduction methods
In this paper, our main contribution is a framework for the automatic configuration of any spectral dimensionality reduction methods. This is achieved, first, by introducing the m...
Michal Lewandowski, Dimitrios Makris, Jean-Christo...
EMNLP
2010
13 years 5 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
ACL
2009
13 years 5 months ago
Employing Topic Models for Pattern-based Semantic Class Discovery
A semantic class is a collection of items (words or phrases) which have semantically peer or sibling relationship. This paper studies the employment of topic models to automatical...
Huibin Zhang, Mingjie Zhu, Shuming Shi, Ji-Rong We...