Search Sciweavers | Sciweavers

4544 search results - page 27 / 909

» Reinforcement Learning with Time

141

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 6 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

178

click to vote

AR
2008

118views more AR 2008»

Efficient Behavior Learning Based on State Value Estimation of Self and Others

15 years 6 months ago

Download www.er.ams.eng.osaka-u.ac.jp

The existing reinforcement learning methods have been seriously suffering from the curse of dimension problem especially when they are applied to multiagent dynamic environments. ...

Yasutake Takahashi, Kentarou Noma, Minoru Asada

claim paper

Read More »

150

click to vote

PERCOM
2009
ACM

101views Computer Networks» more PERCOM 2009»

Proactive and Adaptive Fuzzy Profile Control for Mobile Phones

16 years 6 months ago

Download www.students.tut.fi

In this paper we describe a context-sensitive way to change an active mobile phone profile. We present a method to create a proactive and adaptive phone profile control system that...

Miika Valtonen, Antti-Matti Vainio, Jukka Vanhala

claim paper

Read More »

182

click to vote

ECAI
2010
Springer

211views Artificial Intelligence» more ECAI 2010»

Case-Based Multiagent Reinforcement Learning: Cases as Heuristics for Selection of Actions

15 years 7 months ago

Download www.iiia.csic.es

This work presents a new approach that allows the use of cases in a case base as heuristics to speed up Multiagent Reinforcement Learning algorithms, combining Case-Based Reasoning...

Reinaldo A. C. Bianchi, Ramon López de M&aa...

claim paper

Read More »

169

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 7 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 27 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers