Search Sciweavers | Sciweavers

135 search results - page 23 / 27

» Bounded Parameter Markov Decision Processes

click to vote

QEST
2006
IEEE

143views Modeling and Simulation» more QEST 2006»

Compositional Performability Evaluation for STATEMATE

14 years 1 months ago

Download ftp.inrialpes.fr

Abstract— This paper reports on our efforts to link an industrial state-of-the-art modelling tool to academic state-of-the-art analysis algorithms. In a nutshell, we enable timed...

Eckard Böde, Marc Herbstritt, Holger Hermanns...

claim paper

Read More »

click to vote

AAAI
2008

123views Intelligent Agents» more AAAI 2008»

Towards Faster Planning with Continuous Resources in Stochastic Domains

13 years 10 months ago

Download www.aaai.org

Agents often have to construct plans that obey resource limits for continuous resources whose consumption can only be characterized by probability distributions. While Markov Deci...

Janusz Marecki, Milind Tambe

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

13 years 9 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

ENTCS
2008

110views more ENTCS 2008»

Game-Based Probabilistic Predicate Abstraction in PRISM

13 years 7 months ago

Download qav.comlab.ox.ac.uk

ion in PRISM1 Mark Kattenbelt Marta Kwiatkowska Gethin Norman David Parker Oxford University Computing Laboratory, Oxford, UK Modelling and verification of systems such as communi...

Mark Kattenbelt, Marta Z. Kwiatkowska, Gethin Norm...

claim paper

Read More »

click to vote

HRI
2007
ACM

133views Human Computer Interaction» more HRI 2007»

Efficient model learning for dialog management

13 years 11 months ago

Download www.eecs.ucf.edu

Intelligent planning algorithms such as the Partially Observable Markov Decision Process (POMDP) have succeeded in dialog management applications [10, 11, 12] because of their rob...

Finale Doshi, Nicholas Roy

claim paper

Read More »

« Prev « First page 23 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers