Search Sciweavers | Sciweavers

116

ICML
2008
IEEE

147views Machine Learning» more ICML 2008»

Apprenticeship learning using linear programming

16 years 5 months ago

In apprenticeship learning, the goal is to learn a policy in a Markov decision process that is at least as good as a policy demonstrated by an expert. The difficulty arises in tha...

Umar Syed, Michael H. Bowling, Robert E. Schapire

claim paper

Read More »

120

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 5 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

135

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

15 years 10 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

93

click to vote

DIGITEL
2007
IEEE

81views Artificial Intelligence» more DIGITEL 2007»

Shadow Box: an interactive learning toy for children

15 years 10 months ago

Download synlab.gatech.edu

The Shadow Box is a tangible computing project that exploits visual association and auditory clues to teach children the representational relationship between words and their mean...

Ja-Young Sung, Aaron Levisohn, Ji-won Song, Ben To...

claim paper

Read More »

128

click to vote

ECML
2007
Springer

183views Machine Learning» more ECML 2007»

An Unsupervised Learning Algorithm for Rank Aggregation

15 years 10 months ago

Download l2r.cs.uiuc.edu

Many applications in information retrieval, natural language processing, data mining, and related ﬁelds require a ranking of instances with respect to a speciﬁed criteria as op...

Alexandre Klementiev, Dan Roth, Kevin Small

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers