Search Sciweavers | Sciweavers

1512 search results - page 274 / 303

» Qualitative reinforcement learning

164

click to vote

AR
1998

106views more AR 1998»

A cognitive robot architecture based on tactile and visual information

15 years 5 months ago

Download www-kasm.nii.ac.jp

In this paper, we propose an architecture for a cognitive robot based on tactile and visual information. Visual information contains various features such as location and area of ...

Kazunori Terada, Takayuki Nakamura, Hideaki Takeda...

claim paper

Read More »

122

click to vote

JETAI
2002

69views more JETAI 2002»

The interaction of representations and planning objectives for decision-theoretic planning tasks

15 years 5 months ago

Download idm-lab.org

We study decision-theoretic planning or reinforcement learning in the presence of traps such as steep slopes for outdoor robots or staircases for indoor robots. In this case, achi...

Sven Koenig, Yaxin Liu

claim paper

Read More »

174

click to vote

ICCBR
2010
Springer

229views Automated Reasoning» more ICCBR 2010»

A General Introspective Reasoning Approach to Web Search for Case Adaptation

15 years 4 months ago

Download www.cs.indiana.edu

Abstract. Acquiring adaptation knowledge for case-based reasoning systems is a challenging problem. Such knowledge is typically elicited from domain experts or extracted from the c...

David B. Leake, Jay H. Powell

claim paper

Read More »

163

Voted

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 4 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

178

click to vote

IAT
2010
IEEE

167views Intelligent Agents» more IAT 2010»

Selecting Operator Queries Using Expected Myopic Gain

15 years 3 months ago

Download www.eecs.umich.edu

When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...

Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...

claim paper

Read More »

« Prev « First page 274 / 303 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers