Search Sciweavers | Sciweavers

23 search results - page 1 / 5

» Stochastic Local Search for POMDP Controllers

171

click to vote

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

15 years 8 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

173

click to vote

JAIR
2008

130views more JAIR 2008»

Online Planning Algorithms for POMDPs

15 years 6 months ago

Download www.jair.org

Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP i...

Stéphane Ross, Joelle Pineau, Sébast...

claim paper

Read More »

188

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

15 years 10 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

163

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 7 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

178

click to vote

IUI
2010
ACM

207views Software Engineering» more IUI 2010»

A POMDP approach to P300-based brain-computer interfaces

16 years 3 months ago

Download ailab.kaist.ac.kr

Most of the previous work on non-invasive brain-computer interfaces (BCIs) has been focused on feature extraction and classification algorithms to achieve high performance for the...

Jaeyoung Park, Kee-Eung Kim, Sungho Jo

claim paper

Read More »

« Prev « First page 1 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers