Search Sciweavers | Sciweavers

2100 search results - page 11 / 420

» Observation Can Be as Effective as Action in Problem Solving

click to vote

CAEPIA
2007
Springer

153views Artificial Intelligence» more CAEPIA 2007»

Fast and Informed Action Selection for Planning with Sensing

14 years 1 months ago

Download www.tecn.upf.es

Consider a robot whose task is to pick up some colored balls from a grid, taking the red balls to a red spot, the blue balls to a blue spot and so on, one by one, without knowing e...

Alexandre Albore, Héctor Palacios, Hector G...

claim paper

Read More »

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

14 years 1 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

click to vote

AAAI
2008

130views Intelligent Agents» more AAAI 2008»

Reasoning about Large Taxonomies of Actions

13 years 8 months ago

Download www.cs.toronto.edu

We design a representation based on the situation calculus to facilitate development, maintenance and elaboration of very large taxonomies of actions. This representation leads to...

Yilan Gu, Mikhail Soutchanski

claim paper

Read More »

click to vote

ATAL
2008
Springer

150views Intelligent Agents» more ATAL 2008»

Continual collaborative planning for mixed-initiative action and interaction

13 years 9 months ago

Download www.informatik.uni-freiburg.de

Multiagent environments are often highly dynamic and only partially observable which makes deliberative action planning computationally hard. In many such environments, however, a...

Michael Brenner

claim paper

Read More »

click to vote

ICML
1995
IEEE

110views Machine Learning» more ICML 1995»

Learning by Observation and Practice: An Incremental Approach for Planning Operator Acquisition

14 years 8 months ago

Download reference.kfupm.edu.sa

This paper describes an approach to automatically learn planning operators by observing expert solution traces and to further refine the operators through practice in a learning-b...

Xuemei Wang

claim paper

Read More »

« Prev « First page 11 / 420 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers