Search Sciweavers | Sciweavers

1684 search results - page 164 / 337

» The lexicographic decision function

120

click to vote

IROS
2009
IEEE

150views Robotics» more IROS 2009»

Learning locomotion over rough terrain using terrain templates

15 years 10 months ago

Download www-clmc.usc.edu

— We address the problem of foothold selection in robotic legged locomotion over very rough terrain. The difﬁculty of the problem we address here is comparable to that of human...

Mrinal Kalakrishnan, Jonas Buchli, Peter Pastor, S...

claim paper

Read More »

128

click to vote

CDC
2008
IEEE

115views Control Systems» more CDC 2008»

Oblivious equilibrium for large-scale stochastic games with unbounded costs

15 years 10 months ago

Download www.stanford.edu

— We study stochastic dynamic games with a large number of players, where players are coupled via their cost functions. A standard solution concept for stochastic games is Markov...

Sachin Adlakha, Ramesh Johari, Gabriel Y. Weintrau...

claim paper

Read More »

110

click to vote

ATAL
2005
Springer

140views Intelligent Agents» more ATAL 2005»

Modeling complex multi-issue negotiations using utility graphs

15 years 9 months ago

Download users.ecs.soton.ac.uk

This paper presents an agent strategy for complex bilateral negotiations over many issues with inter-dependent valuations. We use ideas inspired by graph theory and probabilistic ...

Valentin Robu, D. J. A. Somefun, Johannes A. La Po...

claim paper

Read More »

155

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 8 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

157

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Authorial Idioms for Target Distributions in TTD-MDPs

15 years 6 months ago

Download www.cc.gatech.edu

In designing Markov Decision Processes (MDP), one must deﬁne the world, its dynamics, a set of actions, and a reward function. MDPs are often applied in situations where there i...

David L. Roberts, Sooraj Bhat, Kenneth St. Clair, ...

claim paper

Read More »

« Prev « First page 164 / 337 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers