Search Sciweavers | Sciweavers

109 search results - page 16 / 22

» Policy teaching through reward function learning

140

click to vote

CORR
2008
Springer

64views Education» more CORR 2008»

Linearly Parameterized Bandits

15 years 6 months ago

Download legacy.orie.cornell.edu

We consider bandit problems involving a large (possibly infinite) collection of arms, in which the expected reward of each arm is a linear function of an r-dimensional random vect...

Paat Rusmevichientong, John N. Tsitsiklis

claim paper

Read More »

209

click to vote

AIPS
2007

174views Artificial Intelligence» more AIPS 2007»

Learning to Plan Using Harmonic Analysis of Diffusion Models

15 years 9 months ago

Download www.cs.umass.edu

This paper summarizes research on a new emerging framework for learning to plan using the Markov decision process model (MDP). In this paradigm, two approaches to learning to plan...

Sridhar Mahadevan, Sarah Osentoski, Jeffrey Johns,...

claim paper

Read More »

199

Voted

NIPS
2007

158views Information Technology» more NIPS 2007»

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

15 years 8 months ago

Download books.nips.cc

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...

Alessandro Lazaric, Marcello Restelli, Andrea Bona...

claim paper

Read More »

205

Voted

IAT
2005
IEEE

180views Intelligent Agents» more IAT 2005»

Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment

16 years 10 days ago

Download www3.ntu.edu.sg

This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value ...

Ah-Hwee Tan, Dan Xiao

claim paper

Read More »

158

Voted

NIPS
2007

133views Information Technology» more NIPS 2007»

Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning

15 years 8 months ago

Download books.nips.cc

Electrical power management in large-scale IT systems such as commercial datacenters is an application area of rapidly growing interest from both an economic and ecological perspe...

Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O....

claim paper

Read More »

« Prev « First page 16 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers