Search Sciweavers | Sciweavers

473 search results - page 16 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

14 years 8 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

click to vote

ATAL
2003
Springer

154views Intelligent Agents» more ATAL 2003»

Coordination in multiagent reinforcement learning: a Bayesian approach

14 years 28 days ago

Download www.cs.toronto.edu

Much emphasis in multiagent reinforcement learning (MARL) research is placed on ensuring that MARL algorithms (eventually) converge to desirable equilibria. As in standard reinfor...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

13 years 6 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

click to vote

ICML
2004
IEEE

158views Machine Learning» more ICML 2004»

Adaptive cognitive orthotics: combining reinforcement learning and constraint-based temporal reasoning

14 years 8 months ago

Download www.eecs.umich.edu

Reminder systems support people with impaired prospective memory and/or executive function, by providing them with reminders of their functional daily activities. We integrate tem...

Matthew R. Rudary, Satinder P. Singh, Martha E. Po...

claim paper

Read More »

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Q-Decomposition for Reinforcement Learning Agents

14 years 8 months ago

Download www.hpl.hp.com

The paper explores a very simple agent design method called Q-decomposition, wherein a complex agent is built from simpler subagents. Each subagent has its own reward function and...

Stuart J. Russell, Andrew Zimdars

claim paper

Read More »

« Prev « First page 16 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers