Sciweavers

4544 search results - page 23 / 909
» Reinforcement Learning with Time
Sort
View
145
Voted
JMLR
2010
119views more  JMLR 2010»
14 years 9 months ago
A Convergent Online Single Time Scale Actor Critic Algorithm
Actor-Critic based approaches were among the first to address reinforcement learning in a general setting. Recently, these algorithms have gained renewed interest due to their gen...
Dotan Di Castro, Ron Meir
151
Voted
ICML
1998
IEEE
15 years 7 months ago
Learning to Drive a Bicycle Using Reinforcement Learning and Shaping
We present and solve a real-world problem of learning to drive a bicycle. We solve the problem by online reinforcement learning using the Sarsa(   )-algorithm. Then we solve the ...
Jette Randløv, Preben Alstrøm
160
Voted
COST
2009
Springer
185views Multimedia» more  COST 2009»
15 years 17 days ago
How an Agent Can Detect and Use Synchrony Parameter of Its Own Interaction with a Human?
Synchrony is claimed by psychology as a crucial parameter of any social interaction: to give to human a feeling of natural interaction, a feeling of agency [17], an agent must be a...
Ken Prepin, Philippe Gaussier
128
Voted
ATAL
2009
Springer
15 years 18 days ago
Replicator Dynamics for Multi-agent Learning: An Orthogonal Approach
Today's society is largely connected and many real life applications lend themselves to be modeled as multi-agent systems. Although such systems as well as their models are d...
Michael Kaisers, Karl Tuyls
122
Voted
ICML
2000
IEEE
16 years 3 months ago
Learning to Fly: An Application of Hierarchical Reinforcement Learning
Malcolm R. K. Ryan, Mark D. Reid