Search Sciweavers | Sciweavers

1234 search results - page 7 / 247

» Multi-criteria Reinforcement Learning

223

Voted

ICML
1998
IEEE

202views Machine Learning» more ICML 1998»

Learning to Drive a Bicycle Using Reinforcement Learning and Shaping

15 years 11 months ago

Download www.cs.mcgill.ca

We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa( )-algorithm. Then we solve the ...

Jette Randløv, Preben Alstrøm

claim paper

Read More »

190

click to vote

JMLR
2002

125views more JMLR 2002»

Lyapunov Design for Safe Reinforcement Learning

15 years 7 months ago

Download www-anw.cs.umass.edu

Lyapunov design methods are used widely in control engineering to design controllers that achieve qualitative objectives, such as stabilizing a system or maintaining a system'...

Theodore J. Perkins, Andrew G. Barto

claim paper

Read More »

171

Voted

ECML
2003
Springer

118views Machine Learning» more ECML 2003»

A New Way to Introduce Knowledge into Reinforcement Learning

16 years 19 days ago

Download www.irisa.fr

We present in this paper a method to introduce a priori knowledge into reinforcement learning using temporally extended actions. The aim of our work is to reduce the learning time ...

Pascal Garcia

claim paper

Read More »

185

click to vote

ML
2002
ACM

114views Machine Learning» more ML 2002»

Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts

15 years 7 months ago

Download www.cs.ou.edu

The execution order of a block of computer instructions on a pipelined machine can make a difference in running time by a factor of two or more. Compilers use heuristic schedulers...

Amy McGovern, J. Eliot B. Moss, Andrew G. Barto

claim paper

Read More »

194

click to vote

NECO
2010

103views more NECO 2010»

Posterior Weighted Reinforcement Learning with State Uncertainty

15 years 5 months ago

Download www.maths.bris.ac.uk

Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn f...

Tobias Larsen, David S. Leslie, Edmund J. Collins,...

claim paper

Read More »

« Prev « First page 7 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers