Search Sciweavers | Sciweavers

582 search results - page 22 / 117

» Gaussian Processes in Reinforcement Learning

click to vote

CEC
2010
IEEE

367views Artificial Intelligence» more CEC 2010»

Learning to overtake in TORCS using simple reinforcement learning

13 years 10 months ago

Download home.dei.polimi.it

In modern racing games programming non-player characters with believable and sophisticated behaviors is getting increasingly challenging. Recently, several works in the literature ...

Daniele Loiacono, Alessandro Prete, Pier Luca Lanz...

claim paper

Read More »

click to vote

SIGGRAPH
2010
ACM

295views Computer Graphics» more SIGGRAPH 2010»

Learning behavior styles with inverse reinforcement learning

14 years 2 months ago

Download grail.cs.washington.edu

We present a method for inferring the behavior styles of character controllers from a small set of examples. We show that a rich set of behavior variations can be captured by dete...

Seong Jae Lee, Zoran Popovic

claim paper

Read More »

click to vote

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

14 years 2 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

click to vote

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

14 years 10 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

click to vote

IJON
2010

140views more IJON 2010»

Multi-task preference learning with an application to hearing aid personalization

13 years 8 months ago

Download www.cs.ru.nl

We present an EM-algorithm for the problem of learning preferences with Gaussian processes in the context of multi-task learning. We validate our approach on an audiological data ...

Adriana Birlutiu, Perry Groot, Tom Heskes

claim paper

Read More »

« Prev « First page 22 / 117 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers