Search Sciweavers | Sciweavers

312 search results - page 18 / 63

» Learning Partially Observable Deterministic Action Models

185

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

16 years 27 days ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

163

Voted

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

16 years 1 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

190

click to vote

ICCV
2009
IEEE

638views Computer Vision» more ICCV 2009»

Time Series Prediction by Chaotic Modeling of Nonlinear Dynamical Systems

16 years 11 months ago

Download eecs.ucf.edu

We use concepts from chaos theory in order to model nonlinear dynamical systems that exhibit deterministic behavior. Observed time series from such a system can be embedded into...

Arslan Basharat, Mubarak Shah

claim paper

Read More »

204

Voted

TSMC
2008

117views more TSMC 2008»

Discovery of High-Level Behavior From Observation of Human Performance in a Strategic Game

15 years 5 months ago

Download www.soartech.com

This paper explores the issues faced in creating a sys-4 tem that can learn tactical human behavior merely by observing5 a human perform the behavior in a simulation. More specific...

Brian S. Stensrud, Avelino J. Gonzalez

claim paper

Read More »

150

click to vote

ICANN
2005
Springer

132views Neural Networks» more ICANN 2005»

Action Understanding and Imitation Learning in a Robot-Human Task

16 years 7 days ago

Download www6.in.tum.de

We report results of an interdisciplinary project which aims at endowing a real robot system with the capacity for learning by goaldirected imitation. The control architecture is b...

Wolfram Erlhagen, Albert Mukovskiy, Estela Bicho, ...

claim paper

Read More »

« Prev « First page 18 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers