Search Sciweavers | Sciweavers

128 search results - page 23 / 26

» Hierarchically Optimal Average Reward Reinforcement Learning

click to vote

CORR
2010
Springer

143views Education» more CORR 2010»

The Non-Bayesian Restless Multi-Armed Bandit: a Case of Near-Logarithmic Regret

13 years 4 months ago

Download www.ece.ucdavis.edu

In the classic Bayesian restless multi-armed bandit (RMAB) problem, there are N arms, with rewards on all arms evolving at each time as Markov chains with known parameters. A play...

Wenhan Dai, Yi Gai, Bhaskar Krishnamachari, Qing Z...

claim paper

Read More »

click to vote

NIPS
2004

138views Information Technology» more NIPS 2004»

New Criteria and a New Algorithm for Learning in Multi-Agent Systems

13 years 9 months ago

Download books.nips.cc

We propose a new set of criteria for learning algorithms in multi-agent systems, one that is more stringent and (we argue) better justified than previous proposed criteria. Our cr...

Rob Powers, Yoav Shoham

claim paper

Read More »

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

13 years 9 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

click to vote

AUSAI
2005
Springer

166views Artificial Intelligence» more AUSAI 2005»

Adaptive Utility-Based Scheduling in Resource-Constrained Systems

14 years 1 months ago

Download labs.oracle.com

This paper addresses the problem of scheduling jobs in soft real-time systems, where the utility of completing each job decreases over time. We present a utility-based framework fo...

David Vengerov

claim paper

Read More »

click to vote

IJRR
2008

139views more IJRR 2008»

Learning to Control in Operational Space

13 years 7 months ago

Download www.kyb.tuebingen.mpg.de

One of the most general frameworks for phrasing control problems for complex, redundant robots is operational space control. However, while this framework is of essential importan...

Jan Peters, Stefan Schaal

claim paper

Read More »

« Prev « First page 23 / 26 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers