Search Sciweavers | Sciweavers

165 search results - page 13 / 33

» Exploration and apprenticeship learning in reinforcement lea...

click to vote

EWRL
2008

186views Machine Learning» more EWRL 2008»

Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter Case

13 years 9 months ago

Download webee.technion.ac.il

We consider reinforcement learning in the parameterized setup, where the model is known to belong to a parameterized family of Markov Decision Processes (MDPs). We further impose ...

Kirill Dyagilev, Shie Mannor, Nahum Shimkin

claim paper

Read More »

click to vote

NN
2002
Springer

113views Neural Networks» more NN 2002»

Control of exploitation-exploration meta-parameter in reinforcement learning

13 years 7 months ago

Download www.fil.ion.ucl.ac.uk

In reinforcement learning (RL), the duality between exploitation and exploration has long been an important issue. This paper presents a new method that controls the balance betwe...

Shin Ishii, Wako Yoshida, Junichiro Yoshimoto

claim paper

Read More »

click to vote

ML
1998
ACM

148views Machine Learning» more ML 1998»

Colearning in Differential Games

13 years 7 months ago

Download www.cs.jhu.edu

Game playing has been a popular problem area for research in artiﬁcial intelligence and machine learning for many years. In almost every study of game playing and machine learnin...

John W. Sheppard

claim paper

Read More »

click to vote

AUSAI
2005
Springer

123views Artificial Intelligence» more AUSAI 2005»

Global Versus Local Constructive Function Approximation for On-Line Reinforcement Learning

14 years 1 months ago

Download eprints.utas.edu.au

: In order to scale to problems with large or continuous state-spaces, reinforcement learning algorithms need to be combined with function approximation techniques. The majority of...

Peter Vamplew, Robert Ollington

claim paper

Read More »

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

14 years 8 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

« Prev « First page 13 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers