Search Sciweavers | Sciweavers

651 search results - page 13 / 131

» Algorithms for Inverse Reinforcement Learning

166

click to vote

AAAI
1993

107views Intelligent Agents» more AAAI 1993»

Complexity Analysis of Real-Time Reinforcement Learning

15 years 7 months ago

Download www.ri.cmu.edu

This paper analyzes the complexity of on-line reinforcement learning algorithms, namely asynchronous realtime versions of Q-learning and value-iteration, applied to the problem of...

Sven Koenig, Reid G. Simmons

claim paper

Read More »

159

Voted

ICML
2009
IEEE

160views Machine Learning» more ICML 2009»

The adaptive k-meteorologists problem and its application to structure learning and feature selection in reinforcement learning

16 years 6 months ago

Download www.research.rutgers.edu

The purpose of this paper is three-fold. First, we formalize and study a problem of learning probabilistic concepts in the recently proposed KWIK framework. We give details of an ...

Carlos Diuk, Lihong Li, Bethany R. Leffler

claim paper

Read More »

280

click to vote

ICAART
2010
INSTICC

509views Intelligent Agents» more ICAART 2010»

Complexity of Stochastic Branch and Bound Methods for Belief Tree Search in Bayesian Reinforcement Learning

16 years 3 months ago

Download arxiv.org

There has been a lot of recent work on Bayesian methods for reinforcement learning exhibiting near-optimal online performance. The main obstacle facing such methods is that in most...

Christos Dimitrakakis

posted by olethros

Read More »

159

click to vote

IJCAI
2007

179views Artificial Intelligence» more IJCAI 2007»

Deictic Option Schemas

15 years 7 months ago

Download www.ijcai.org

Deictic representation is a representational paradigm, based on selective attention and pointers, that allows an agent to learn and reason about rich complex environments. In this...

Balaraman Ravindran, Andrew G. Barto, Vimal Mathew

claim paper

Read More »

166

click to vote

NETWORKING
2007

110views Computer Networks» more NETWORKING 2007»

Reinforcement Learning-Based Load Shared Sequential Routing

15 years 7 months ago

Download www.ece.mcgill.ca

We consider event dependent routing algorithms for on-line explicit source routing in MPLS networks. The proposed methods are based on load shared sequential routing in which load ...

Fariba Heidari, Shie Mannor, Lorne Mason

claim paper

Read More »

« Prev « First page 13 / 131 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers