Search Sciweavers | Sciweavers

109 search results - page 17 / 22

» Policy teaching through reward function learning

184

Voted

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

203

click to vote

ICWL
2004
Springer

200views Internet Technology» more ICWL 2004»

Learning Algorithms with an Electronic Chalkboard over the Web

16 years 2 days ago

Download www.inf.fu-berlin.de

This paper describes a system for the animation of algorithms on an electronic chalkboard. The instructor teaching an algorithm can enter data directly through a drawing- the algor...

Margarita Esponda Argüero, Raúl Rojas

claim paper

Read More »

183

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

16 years 7 months ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

156

click to vote

AAAI
2010

134views Intelligent Agents» more AAAI 2010»

Reinforcement Learning Via Practice and Critique Advice

15 years 8 months ago

Download web.engr.oregonstate.edu

We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our setting, the learner alternates between practicing, where learning is based on ac...

Kshitij Judah, Saikat Roy, Alan Fern, Thomas G. Di...

claim paper

Read More »

176

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

16 years 1 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

« Prev « First page 17 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers