Search Sciweavers | Sciweavers

21

PRIMA
2009
Springer

102views Intelligent Agents» more PRIMA 2009»

Recursive Adaptation of Stepsize Parameter for Non-stationary Environments

14 years 2 months ago

In this article, we propose a method to adapt stepsize parameters used in reinforcement learning for dynamic environments. In general reinforcement learning situations, a stepsize...

Itsuki Noda

claim paper

Read More »

25

click to vote

AAAI
2010

173views Intelligent Agents» more AAAI 2010»

Integrating Sample-Based Planning and Model-Based Reinforcement Learning

13 years 9 months ago

Download paul.rutgers.edu

Recent advancements in model-based reinforcement learning have shown that the dynamics of many structured domains (e.g. DBNs) can be learned with tractable sample complexity, desp...

Thomas J. Walsh, Sergiu Goschin, Michael L. Littma...

claim paper

Read More »

24

click to vote

AR
2007

105views more AR 2007»

Reinforcement learning of a continuous motor sequence with hidden states

13 years 7 months ago

Download www.bdc.brain.riken.go.jp

—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...

Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...

claim paper

Read More »

18

click to vote

NIPS
1998

88views Information Technology» more NIPS 1998»

Scheduling Straight-Line Code Using Reinforcement Learning and Rollouts

13 years 8 months ago

Download www.cs.ou.edu

The execution order of a block of computer instructions can make a difference in its running time by a factor of two or more. In order to achieve the best possible speed, compiler...

Amy McGovern, J. Eliot B. Moss

claim paper

Read More »

25

click to vote

ATAL
2007
Springer

155views Intelligent Agents» more ATAL 2007»

Reinforcement learning in extensive form games with incomplete information: the bargaining case study

14 years 1 months ago

Download home.dei.polimi.it

We consider the problem of ﬁnding optimal strategies in inﬁnite extensive form games with incomplete information that are repeatedly played. This problem is still open in lite...

Alessandro Lazaric, Jose Enrique Munoz de Cote, Ni...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers