Search Sciweavers | Sciweavers

109 search results - page 15 / 22

» Policy teaching through reward function learning

159

Voted

ECML
2004
Springer

139views Machine Learning» more ECML 2004»

Batch Reinforcement Learning with State Importance

16 years 3 days ago

Download www.research.rutgers.edu

Abstract. We investigate the problem of using function approximation in reinforcement learning where the agent’s policy is represented as a classiﬁer mapping states to actions....

Lihong Li, Vadim Bulitko, Russell Greiner

claim paper

Read More »

163

Voted

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

16 years 1 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

176

Voted

KCAP
2009
ACM

171views Information Technology» more KCAP 2009»

Interactively shaping agents via human reinforcement: the TAMER framework

16 years 1 months ago

Download userweb.cs.utexas.edu

As computational learning agents move into domains that incur real costs (e.g., autonomous driving or ﬁnancial investment), it will be necessary to learn good policies without n...

W. Bradley Knox, Peter Stone

claim paper

Read More »

186

click to vote

CORR
2008
Springer

189views Education» more CORR 2008»

Algorithms for Dynamic Spectrum Access with Learning for Cognitive Radio

15 years 6 months ago

Download www.ifp.illinois.edu

We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooperati...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

168

Voted

ICML
2010
IEEE

258views Machine Learning» more ICML 2010»

Feature Selection as a One-Player Game

15 years 7 months ago

Download www.lri.fr

This paper formalizes Feature Selection as a Reinforcement Learning problem, leading to a provably optimal though intractable selection policy. As a second contribution, this pape...

Romaric Gaudel, Michèle Sebag

claim paper

Read More »

« Prev « First page 15 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers