Search Sciweavers | Sciweavers

1234 search results - page 178 / 247

» Multi-criteria Reinforcement Learning

133

Voted

KI
2002
Springer

108views Artificial Intelligence» more KI 2002»

Qualitative Velocity and Ball Interception

15 years 3 months ago

Download fstolzenburg.hs-harz.de

In many approaches for qualitative spatial reasoning, navigation of an agent in a more or less static environment is considered (e.g. in the double-cross calculus [12]). However, i...

Frieder Stolzenburg, Oliver Obst, Jan Murray

claim paper

Read More »

144

click to vote

ICCBR
2010
Springer

229views Automated Reasoning» more ICCBR 2010»

A General Introspective Reasoning Approach to Web Search for Case Adaptation

15 years 2 months ago

Download www.cs.indiana.edu

Abstract. Acquiring adaptation knowledge for case-based reasoning systems is a challenging problem. Such knowledge is typically elicited from domain experts or extracted from the c...

David B. Leake, Jay H. Powell

claim paper

Read More »

134

Voted

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 2 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

154

click to vote

IAT
2010
IEEE

167views Intelligent Agents» more IAT 2010»

Selecting Operator Queries Using Expected Myopic Gain

15 years 1 months ago

Download www.eecs.umich.edu

When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...

Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...

claim paper

Read More »

133

Voted

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

14 years 10 months ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

« Prev « First page 178 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers