Search Sciweavers | Sciweavers

61 search results - page 4 / 13

» Market-Based Reinforcement Learning in Partially Observable ...

137

click to vote

AAAI
2010

171views Intelligent Agents» more AAAI 2010»

Reinforcement Learning via AIXI Approximation

15 years 5 months ago

Download jveness.info

This paper introduces a principled approach for the design of a scalable general reinforcement learning agent. This approach is based on a direct approximation of AIXI, a Bayesian...

Joel Veness, Kee Siong Ng, Marcus Hutter, David Si...

claim paper

Read More »

156

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

15 years 10 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

click to vote

ANOR
2010

85views more ANOR 2010»

Inventory management with partially observed nonstationary demand

15 years 4 months ago

Download www.pstat.ucsb.edu

Abstract. We consider a continuous-time model for inventory management with Markov modulated non-stationary demands. We introduce active learning by assuming that the state of the ...

Erhan Bayraktar, Michael Ludkovski

claim paper

Read More »

210

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

13 years 12 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

114

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

15 years 10 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

« Prev « First page 4 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers