Search Sciweavers | Sciweavers

374 search results - page 34 / 75

» Multiagent Reinforcement Learning: Theoretical Framework and...

168

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

15 years 29 days ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

134

click to vote

ATAL
2008
Springer

184views Intelligent Agents» more ATAL 2008»

Sequential decision making with untrustworthy service providers

15 years 5 months ago

Download www.aamas-conference.org

In this paper, we deal with the sequential decision making problem of agents operating in computational economies, where there is uncertainty regarding the trustworthiness of serv...

W. T. Luke Teacy, Georgios Chalkiadakis, Alex Roge...

claim paper

Read More »

114

click to vote

AAAI
2010

154views Intelligent Agents» more AAAI 2010»

Towards Multiagent Meta-level Control

15 years 4 months ago

Download coitweb.uncc.edu

Embedded systems consisting of collaborating agents capable of interacting with their environment are becoming ubiquitous. It is crucial for these systems to be able to adapt to t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

124

click to vote

NIPS
2007

143views Information Technology» more NIPS 2007»

A Game-Theoretic Approach to Apprenticeship Learning

15 years 4 months ago

Download books.nips.cc

We study the problem of an apprentice learning to behave in an environment with an unknown reward function by observing the behavior of an expert. We follow on the work of Abbeel ...

Umar Syed, Robert E. Schapire

claim paper

Read More »

122

click to vote

ICONIP
2007

147views Information Technology» more ICONIP 2007»

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

15 years 4 months ago

Download www.nc.irp.oist.jp

The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and aﬀective systems by building artiﬁcial agents that share the natural biological constraints...

Eiji Uchibe, Kenji Doya

claim paper

Read More »

« Prev « First page 34 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers