Search Sciweavers | Sciweavers

1233 search results - page 243 / 247

» Feudal Reinforcement Learning

163

click to vote

ATAL
2006
Springer

147views Intelligent Agents» more ATAL 2006»

Efficient agents for cliff-edge environments with a large set of decision options

15 years 9 months ago

Download www.umiacs.umd.edu

This paper proposes an efficient agent for competing in Cliff Edge (CE) environments, such as sealed-bid auctions, dynamic pricing and the ultimatum game. The agent competes in on...

Ron Katz, Sarit Kraus

claim paper

Read More »

151

click to vote

AAAI
2010

140views Intelligent Agents» more AAAI 2010»

The Model-Based Approach to Autonomous Behavior: A Personal View

15 years 7 months ago

Download www.dtic.upf.edu

The selection of the action to do next is one of the central problems faced by autonomous agents. In AI, three approaches have been used to address this problem: the programming-b...

Hector Geffner

claim paper

Read More »

168

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

15 years 7 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

149

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 7 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

161

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 7 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

« Prev « First page 243 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers