Expectation Maximization for Weakly Labeled Data

15 years 1 months ago

Download characters.media.mit.edu

We call data weakly labeled if it has no exact label but rather a numerical indication of correctness of the label "guessed" by the learning algorithm - a situation commonly encountered in problems of reinforcement learning. The term emphasizes similarities of our approach to the known techniques of solving unsupervised and transductive problems. In this paper we present an on-line algorithm that casts the problem as a multi-arm bandit with hidden state and solves it iteratively within the Expectation-Maximization framework. The hidden state is represented by a parameterized probability distribution over states tied to the reward. The parameterization is formally justified, allowing for smooth blending between likelihood- and reward-based costs.

Yuri A. Ivanov, Bruce Blumberg, Alex Pentland

Real-time Traffic

Hidden State | ICML 2001 | Machine Learning | Parameterized Probability Distribution | Term Emphasizes Similarities |

claim paper

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2001
Where	ICML
Authors	Yuri A. Ivanov, Bruce Blumberg, Alex Pentland

Comments (0)

Sciweavers

Expectation Maximization for Weakly Labeled Data

Hidden State | ICML 2001 | Machine Learning | Parameterized Probability Distribution | Term Emphasizes Similarities |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers