Search Sciweavers | Sciweavers

343 search results - page 57 / 69

» Action discovery for reinforcement learning

click to vote

NIPS
2008

271views Information Technology» more NIPS 2008»

Goal-directed decision making in prefrontal cortex: a computational framework

13 years 9 months ago

Download www.princeton.edu

Research in animal learning and behavioral neuroscience has distinguished between two forms of action control: a habit-based form, which relies on stored action values, and a goal...

Matthew Botvinick, James An

claim paper

Read More »

click to vote

AROBOTS
1998

111views more AROBOTS 1998»

Emergence and Categorization of Coordinated Visual Behavior Through Embodied Interaction

13 years 7 months ago

Download www.informatics.sussex.ac.uk

This paper discusses the emergence of sensorimotor coordination for ESCHeR, a 4DOF redundant foveated robot-head, by interaction with its environment. A feedback-error-learning(FEL...

Luc Berthouze, Yasuo Kuniyoshi

claim paper

Read More »

click to vote

AINA
2006
IEEE

179views Computer Networks» more AINA 2006»

Constrained Flooding: A Robust and Efficient Routing Framework for Wireless Sensor Networks

13 years 11 months ago

Download www.parc.com

Flooding protocols for wireless networks in general have been shown to be very inefficient and therefore are mainly used in network initialization or route discovery and maintenan...

Ying Zhang, Markus P. J. Fromherz

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

13 years 7 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

13 years 8 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

« Prev « First page 57 / 69 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers