Search Sciweavers | Sciweavers

75 search results - page 3 / 15

» Reinforcement Learning for MDPs with Constraints

147

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 9 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

204

click to vote

ICML
2002
IEEE

156views Machine Learning» more ICML 2002»

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

16 years 8 months ago

Download select.cs.cmu.edu

One of the central challenges in reinforcement learning is to balance the exploration/exploitation tradeoff while scaling up to large problems. Although model-based reinforcement ...

Carlos Guestrin, Relu Patrascu, Dale Schuurmans

claim paper

Read More »

182

click to vote

ICAART
2010
INSTICC

222views Intelligent Agents» more ICAART 2010»

16 years 4 months ago

Exploiting Similarity Information in Reinforcement Learning - Similarity Models for Multi-Armed Bandits and MDPs

Download personal.unileoben.ac.at

Ronald Ortner

claim paper

Read More »

208

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

16 years 22 days ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

218

click to vote

JCP
2008

139views more JCP 2008»

Agent Learning in Relational Domains based on Logical MDPs with Negation

15 years 7 months ago

Download www.academypublisher.com

In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...

Song Zhiwei, Chen Xiaoping, Cong Shuang

claim paper

Read More »

« Prev « First page 3 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers