Search Sciweavers | Sciweavers

802 search results - page 48 / 161

» Experts in a Markov Decision Process

150

click to vote

HICSS
2005
IEEE

188views Biometrics» more HICSS 2005»

Developing Group Decision Support Systems for Deception Detection

15 years 11 months ago

Download csdl2.computer.org

Achieving information assurance and security is a complex and challenging task, which is crucial from national and personal security point of views. Research in detecting deceptiv...

Amit V. Deokar, Therani Madhusudan

claim paper

Read More »

188

click to vote

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

14 years 1 months ago

Download www.mit.edu

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

150

click to vote

NIPS
2004

125views Information Technology» more NIPS 2004»

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs

15 years 7 months ago

Download books.nips.cc

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

150

click to vote

ICML
2005
IEEE

157views Machine Learning» more ICML 2005»

A causal approach to hierarchical decomposition of factored MDPs

16 years 6 months ago

Download www-anw.cs.umass.edu

We present Variable Influence Structure Analysis, an algorithm that dynamically performs hierarchical decomposition of factored Markov decision processes. Our algorithm determines...

Anders Jonsson, Andrew G. Barto

claim paper

Read More »

168

click to vote

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 10 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

« Prev « First page 48 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers