Leveraging Common Structure to Improve Prediction across Related Datasets

9 years 1 days ago

Download www.cs.cmu.edu

In many applications, training data is provided in the form of related datasets obtained from several sources, which typically affects the sample distribution. The learned classiﬁcation models, which are expected to perform well on similar data coming from new sources, often suffer due to bias introduced by what we call ‘spurious’ samples – those due to source characteristics and not representative of any other part of the data. As standard outlier detection and robust classiﬁcation usually fall short of determining groups of spurious samples, we propose a procedure which identiﬁes the common structure across datasets by minimizing a multi-dataset divergence metric, increasing accuracy for new datasets. Problem statement Often, the data available for learning is collected from different sources, making it likely that the differences between these groups break typical assumptions such as the samples being independent and identically distributed. It is often the case that da...

Matt Barnes, Nick Gisolfi, Madalina Fiterau, Artur

Real-time Traffic

AAAI 2015 | Intelligent Agents |

claim paper

Post Info
More Details (n/a)

Added	27 Mar 2016
Updated	27 Mar 2016
Type	Journal
Year	2015
Where	AAAI
Authors	Matt Barnes, Nick Gisolfi, Madalina Fiterau, Artur Dubrawski

Comments (0)

Sciweavers

Leveraging Common Structure to Improve Prediction across Related Datasets

AAAI 2015 | Intelligent Agents |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers