Search Sciweavers | Sciweavers

121 search results - page 15 / 25

» Toward Off-Policy Learning Control with Function Approximati...

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

13 years 10 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

click to vote

ACMSE
2007
ACM

151views Theoretical Computer Science» more ACMSE 2007»

BehaviorSim: towards an educational tool for behavior-based agent

14 years 1 months ago

Download www.cs.gsu.edu

A major paradigm of modeling the decision making of autonomous agents is through behavior-based network models. The network consists of distributed behaviors that compete (or coop...

Pavel Lakhtanau, Xiaolin Hu, Fasheng Qiu

claim paper

Read More »

click to vote

COLT
2003
Springer

119views Machine Learning» more COLT 2003»

Learning with Rigorous Support Vector Machines

14 years 2 months ago

Download www.cs.rpi.edu

We examine the so-called rigorous support vector machine (RSVM) approach proposed by Vapnik (1998). The formulation of RSVM is derived by explicitly implementing the structural ris...

Jinbo Bi, Vladimir Vapnik

claim paper

Read More »

click to vote

AAAI
2011

178views Intelligent Agents» more AAAI 2011»

Combining Learned Discrete and Continuous Action Models

12 years 9 months ago

Download www.eecs.umich.edu

Action modeling is an important skill for agents that must perform tasks in novel domains. Previous work on action modeling has focused on learning STRIPS operators in discrete, r...

Joseph Z. Xu, John E. Laird

claim paper

Read More »

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

13 years 7 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

« Prev « First page 15 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers