Search Sciweavers | Sciweavers

102 search results - page 17 / 21

» MDPs with Non-Deterministic Policies

155

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 6 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

153

click to vote

WISE
2002
Springer

111views Internet Technology» more WISE 2002»

An MDP-based Peer-to-Peer Search Server Network

15 years 11 months ago

Download www.cse.ust.hk

A distributed search system consists of a large number of autonomous search servers logically connected in a peerto-peer network. Each search server maintains a local index of a c...

Yipeng Shen, Dik Lun Lee

claim paper

Read More »

147

click to vote

IJCAI
2003

118views Artificial Intelligence» more IJCAI 2003»

Simultaneous Adversarial Multi-Robot Learning

15 years 7 months ago

Download www.cs.cmu.edu

Multi-robot learning faces all of the challenges of robot learning with all of the challenges of multiagent learning. There has been a great deal of recent research on multiagent ...

Michael H. Bowling, Manuela M. Veloso

claim paper

Read More »

164

Voted

JAIR
2006

101views more JAIR 2006»

Resource Allocation Among Agents with MDP-Induced Preferences

15 years 6 months ago

Download www.jair.org

Allocating scarce resources among agents to maximize global utility is, in general, computationally challenging. We focus on problems where resources enable agents to execute acti...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

188

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 6 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

« Prev « First page 17 / 21 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers