Sciweavers

186 search results - page 15 / 38
» Process Knowledge and Data Quality Outcomes
Sort
View
GECCO
2008
Springer
137views Optimization» more  GECCO 2008»
13 years 9 months ago
Informative sampling for large unbalanced data sets
Selective sampling is a form of active learning which can reduce the cost of training by only drawing informative data points into the training set. This selected training set is ...
Zhenyu Lu, Anand I. Rughani, Bruce I. Tranmer, Jos...
IPPS
2010
IEEE
13 years 5 months ago
Large-scale multi-dimensional document clustering on GPU clusters
Document clustering plays an important role in data mining systems. Recently, a flocking-based document clustering algorithm has been proposed to solve the problem through simulat...
Yongpeng Zhang, Frank Mueller, Xiaohui Cui, Thomas...
ICMLC
2010
Springer
13 years 5 months ago
Data mining model in analyzing portuguese studies as the second language acquisition
: Portuguese is specifically a difficult language with luxuriant tenses, and second language acquisition (SLA) is regarded as highly variable. Many (Chinese) students who learn Por...
Sam Chao, Fai Wong, CustoDio Cavaco Martins
ISIPTA
2005
IEEE
140views Mathematics» more  ISIPTA 2005»
14 years 1 months ago
Conservative Rules for Predictive Inference with Incomplete Data
This paper addresses the following question: how should we update our beliefs after observing some incomplete data, in order to make credible predictions about new, and possibly i...
Marco Zaffalon
ECML
2006
Springer
13 years 11 months ago
Unsupervised Multiple-Instance Learning for Functional Profiling of Genomic Data
Multiple-instance learning (MIL) is a popular concept among the AI community to support supervised learning applications in situations where only incomplete knowledge is available....
Corneliu Henegar, Karine Clément, Jean-Dani...