Search Sciweavers | Sciweavers

651 search results - page 72 / 131

» Algorithms for Inverse Reinforcement Learning

click to vote

ICML
2005
IEEE

137views Machine Learning» more ICML 2005»

Learning to compete, compromise, and cooperate in repeated general-sum games

14 years 11 months ago

Download www.mit.edu

Learning algorithms often obtain relatively low average payoffs in repeated general-sum games between other learning agents due to a focus on myopic best-response and one-shot Nas...

Jacob W. Crandall, Michael A. Goodrich

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

14 years 4 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

JCP
2008

139views more JCP 2008»

Agent Learning in Relational Domains based on Logical MDPs with Negation

13 years 10 months ago

Download www.academypublisher.com

In this paper, we propose a model named Logical Markov Decision Processes with Negation for Relational Reinforcement Learning for applying Reinforcement Learning algorithms on the ...

Song Zhiwei, Chen Xiaoping, Cong Shuang

claim paper

Read More »

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

14 years 11 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

click to vote

HIS
2008

122views Information Technology» more HIS 2008»

New Crossover Operator for Evolutionary Rule Discovery in XCS

13 years 11 months ago

Download www.salle.url.edu

XCS is a learning classifier system that combines a reinforcement learning scheme with evolutionary algorithms to evolve rule sets on-line by means of the interaction with an envi...

Sergio Morales-Ortigosa, Albert Orriols-Puig, Este...

claim paper

Read More »

« Prev « First page 72 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers