Two-level clustering approach to training data instance selection: A case study for the steel industry

16 years 1 months ago

Download www.ee.oulu.fi

— Nowadays, huge amounts of information from different industrial processes are stored into databases and companies can improve their production efﬁciency by mining some new knowledge from this information. However, when these databases becomes too large, it is not efﬁcient to process all the available data with practical data mining applications. As a solution, different approaches for intelligent selection of training data for model ﬁtting have to be developed. In this article, training instances are selected to ﬁt predictive regression models developed for optimization of the steel manufacturing process settings beforehand, and the selection is approached from a clustering point of view. Because basic k-means clustering was found to consume too much time and memory for the purpose, a new algorithm was developed to divide the data coarsely, after which k-means clustering could be performed. The instances were selected using the cluster structure by weighting more the observ...

Heli Koskimäki, Ilmari Juutilainen, Perttu La

Real-time Traffic

Artificial Intelligence | IJCNN 2008 | K-means Clustering | Practical Data Mining | Predictive Regression Models |

claim paper

Added	31 May 2010
Updated	31 May 2010
Type	Conference
Year	2008
Where	IJCNN
Authors	Heli Koskimäki, Ilmari Juutilainen, Perttu Laurinen, Juha Röning

Sciweavers

Two-level clustering approach to training data instance selection: A case study for the steel industry

Artificial Intelligence | IJCNN 2008 | K-means Clustering | Practical Data Mining | Predictive Regression Models |

Explore & Download

Productivity Tools

Sciweavers