Search Sciweavers | Sciweavers

58 search results - page 11 / 12

» Using Learned Policies in Heuristic-Search Planning

click to vote

ICML
2008
IEEE

122views Machine Learning» more ICML 2008»

Reinforcement learning in the presence of rare events

14 years 10 months ago

Download www.ece.mcgill.ca

We consider the task of reinforcement learning in an environment in which rare significant events occur independently of the actions selected by the controlling agent. If these ev...

Jordan Frank, Shie Mannor, Doina Precup

claim paper

Read More »

click to vote

AAAI
2010

154views Intelligent Agents» more AAAI 2010»

Towards Multiagent Meta-level Control

13 years 11 months ago

Download coitweb.uncc.edu

Embedded systems consisting of collaborating agents capable of interacting with their environment are becoming ubiquitous. It is crucial for these systems to be able to adapt to t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

click to vote

JMLR
2010

149views more JMLR 2010»

Coherent Inference on Optimal Play in Game Trees

13 years 4 months ago

Download jmlr.csail.mit.edu

Round-based games are an instance of discrete planning problems. Some of the best contemporary game tree search algorithms use random roll-outs as data. Relying on a good policy, ...

Philipp Hennig, David H. Stern, Thore Graepel

claim paper

Read More »

click to vote

ATAL
2010
Springer

123views Intelligent Agents» more ATAL 2010»

Linear options

13 years 11 months ago

Download www.eecs.umich.edu

Learning, planning, and representing knowledge in large state t multiple levels of temporal abstraction are key, long-standing challenges for building flexible autonomous agents. ...

Jonathan Sorg, Satinder P. Singh

claim paper

Read More »

click to vote

HRI
2007
ACM

133views Human Computer Interaction» more HRI 2007»

Efficient model learning for dialog management

14 years 1 months ago

Download www.eecs.ucf.edu

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...

Finale Doshi, Nicholas Roy

claim paper

Read More »

« Prev « First page 11 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers