Sciweavers

4544 search results - page 23 / 909
» Reinforcement Learning with Time
Sort
View
JMLR
2010
119views more  JMLR 2010»
13 years 2 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
ICML
1998
IEEE
13 years 11 months ago
Learning to Drive a Bicycle Using Reinforcement Learning and Shaping
We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa(   )-algorithm. Then we solve the ...
Jette Randløv, Preben Alstrøm
COST
2009
Springer
185views Multimedia» more  COST 2009»
13 years 5 months ago
How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?
Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...
Ken Prepin, Philippe Gaussier
ATAL
2009
Springer
13 years 5 months ago
Replicator Dynamics for Multi-agent Learning: An Orthogonal Approach
Today's society is largely connected and many real life applications lend themselves to be modeled as multi-agent systems. Although such systems as well as their models are d...
Michael Kaisers, Karl Tuyls