Search Sciweavers | Sciweavers

215 search results - page 15 / 43

» Model-Based Reinforcement Learning with Continuous States an...

136

click to vote

ICMLA
2007

92views Machine Learning» more ICMLA 2007»

Control of a re-entrant line manufacturing model with a reinforcement learning approach

15 years 7 months ago

Download www.smitlab.uc.edu

This paper presents the application of a reinforcement learning (RL) approach for the near-optimal control of a re-entrant line manufacturing (RLM) model. The RL approach utilizes...

José A. Ramírez-Hernández, Em...

claim paper

Read More »

123

click to vote

AAAI
2007

68views Intelligent Agents» more AAAI 2007»

A Reinforcement Learning Algorithm with Polynomial Interaction Complexity for Only-Costly-Observable MDPs

15 years 8 months ago

Download www.aaai.org

An Unobservable MDP (UMDP) is a POMDP in which there are no observations. An Only-Costly-Observable MDP (OCOMDP) is a POMDP which extends an UMDP by allowing a particular costly a...

Roy Fox, Moshe Tennenholtz

claim paper

Read More »

171

click to vote

JCP
2008

139views more JCP 2008»

Agent Learning in Relational Domains based on Logical MDPs with Negation

15 years 6 months ago

Download www.academypublisher.com

In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...

Song Zhiwei, Chen Xiaoping, Cong Shuang

claim paper

Read More »

154

click to vote

ICML
2004
IEEE

167views Machine Learning» more ICML 2004»

Bellman goes relational

16 years 6 months ago

Download people.csail.mit.edu

Motivated by the interest in relational reinforcement learning, we introduce a novel relational Bellman update operator called ReBel. It employs a constraint logic programming lan...

Kristian Kersting, Martijn Van Otterlo, Luc De Rae...

claim paper

Read More »

164

click to vote

IWANN
1999
Springer

115views Neural Networks» more IWANN 1999»

Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning

15 years 10 months ago

Download www.cs.colostate.edu

To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...

R. Matthew Kretchmar, Charles W. Anderson

claim paper

Read More »

« Prev « First page 15 / 43 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers