Search Sciweavers | Sciweavers

473 search results - page 83 / 95

» Optimal policy switching algorithms for reinforcement learni...

click to vote

ICML
2006
IEEE

144views Machine Learning» more ICML 2006»

Probabilistic inference for solving discrete and continuous state Markov Decision Processes

14 years 8 months ago

Download eprints.pascal-network.org

Inference in Markov Decision Processes has recently received interest as a means to infer goals of an observed action, policy recognition, and also as a tool to compute policies. ...

Marc Toussaint, Amos J. Storkey

claim paper

Read More »

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Fitted Q-iteration by Advantage Weighted Regression

13 years 9 months ago

Download www.kyb.mpg.de

Recently, fitted Q-iteration (FQI) based methods have become more popular due to their increased sample efficiency, a more stable learning process and the higher quality of the re...

Gerhard Neumann, Jan Peters

claim paper

Read More »

click to vote

CVPR
2011
IEEE

499views Computer Vision» more CVPR 2011»

Learning Context for Collective Activity Recognition

13 years 3 months ago

Download www.eecs.umich.edu

In this paper we present a framework for the recognition of collective human activities. A collective activity is deﬁned or reinforced by the existence of coherent behavior of i...

Wongun Choi, Silvio Savarese, Khuram Shahid

claim paper

Read More »

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

14 years 2 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

click to vote

JAIR
2008

121views more JAIR 2008»

A Constraint Programming Approach for Solving a Queueing Control Problem

13 years 7 months ago

Download www.jair.org

In a facility with front room and back room operations, it is useful to switch workers between the rooms in order to cope with changing customer demand. Assuming stochastic custom...

Daria Terekhov, J. Christopher Beck

claim paper

Read More »

« Prev « First page 83 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers