Search Sciweavers | Sciweavers

93 search results - page 16 / 19

» Learning to overtake in TORCS using simple reinforcement lea...

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

14 years 4 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

14 years 8 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

NIPS
1997

121views Information Technology» more NIPS 1997»

Generalized Prioritized Sweeping

13 years 9 months ago

Download www.cs.huji.ac.il

Prioritized sweeping is a model-based reinforcement learning method that attempts to focus an agent’s limited computational resources to achieve a good estimate of the value of ...

David Andre, Nir Friedman, Ronald Parr

claim paper

Read More »

click to vote

CAINE
2008

127views Computer Science» more CAINE 2008»

Scripted Artificially Intelligent Basic Online Tactical Simulation

13 years 9 months ago

Download www.cse.unr.edu

For many years, introductory Computer Science courses have followed the same teaching paradigms. These paradigms utilize only simple console windows; more interactive approaches t...

Jesse D. Phillips, Roger V. Hoang, Joseph D. Mahsm...

claim paper

Read More »

click to vote

BC
2008

134views more BC 2008»

Interacting with an artificial partner: modeling the role of emotional aspects

13 years 7 months ago

Download homes.dsi.unimi.it

In this paper we introduce a simple model based on probabilistic finite state automata to describe an emotional interaction between a robot and a human user, or between simulated a...

Isabella Cattinelli, Massimiliano Goldwurm, N. Alb...

claim paper

Read More »

« Prev « First page 16 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers