Search Sciweavers | Sciweavers

252 search results - page 19 / 51

» Learning Partially Observable Action Models: Efficient Algor...

148

click to vote

HASE
2008
IEEE

132views Control Systems» more HASE 2008»

Small Logs for Transactional Services: Distinction is Much More Accurate than (Positive) Discrimination

16 years 29 days ago

Download www.labri.fr

For complex services, logging is an integral part of many middleware aspects, especially, transactions and monitoring. In the event of a failure, the log allows us to deduce the c...

Debmalya Biswas, Thomas Gazagnaire, Blaise Genest

claim paper

Read More »

184

click to vote

ATAL
2006
Springer

127views Intelligent Agents» more ATAL 2006»

Learning to commit in repeated games

15 years 10 months ago

Download staff.science.uva.nl

Learning to converge to an efficient, i.e., Pareto-optimal Nash equilibrium of the repeated game is an open problem in multiagent learning. Our goal is to facilitate the learning ...

Stéphane Airiau, Sandip Sen

claim paper

Read More »

170

click to vote

ICML
2004
IEEE

142views Machine Learning» more ICML 2004»

Learning and discovery of predictive state representations in dynamical systems with reset

16 years 7 months ago

Download www.cc.gatech.edu

Predictive state representations (PSRs) are a recently proposed way of modeling controlled dynamical systems. PSR-based models use predictions of observable outcomes of tests that...

Michael R. James, Satinder P. Singh

claim paper

Read More »

183

click to vote

ALT
2005
Springer

137views Machine Learning» more ALT 2005»

Defensive Universal Learning with Experts

16 years 3 months ago

Download www.idsia.ch

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedba...

Jan Poland, Marcus Hutter

claim paper

Read More »

179

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 7 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

« Prev « First page 19 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers