Sciweavers

306

Publication

233views

14 years 5 months ago

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

257

click to vote

Publication

154views

Preference elicitation and inverse reinforcement learning

14 years 9 months ago

Download arxiv.org

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous w...

Constantin Rothkopf, Christos Dimitrakakis

posted by olethros

Read More »

342

click to vote

EJASMP
2011

291views Applied Computing» more EJASMP 2011»

Phoneme and Sentence-Level Ensembles for Speech Recognition

14 years 10 months ago

Download bengio.abracadoudou.com

We address the question of whether and how boosting and bagging can be used for speech recognition. In order to do this, we compare two diﬀerent boosting schemes, one at the pho...

Christos Dimitrakakis, Samy Bengio

posted by olethros

Read More »

222

click to vote

JMLR
2010

175views more JMLR 2010»

Bayesian variable order Markov models

15 years 1 months ago

Download jmlr.csail.mit.edu

Christos Dimitrakakis

posted by olethros

Read More »

225

click to vote

CORR
2006
Springer

140views Education» more CORR 2006»

Nearly optimal exploration-exploitation decision thresholds

15 years 7 months ago

Download www.idiap.ch

While in general trading off exploration and exploitation in reinforcement learning is hard, under some formulations relatively simple solutions exist. Optimal decision thresholds ...

Christos Dimitrakakis

posted by olethros

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers