Search Sciweavers | Sciweavers

168 search results - page 33 / 34

» Reinforcement Learning Algorithm for Partially Observable Ma...

116

click to vote

AAAI
2007

88views Intelligent Agents» more AAAI 2007»

Continuous State POMDPs for Object Manipulation Tasks

15 years 4 months ago

Download www.aaai.org

My research focus is on using continuous state partially observable Markov decision processes (POMDPs) to perform object manipulation tasks using a robotic arm. During object mani...

Emma Brunskill

claim paper

Read More »

114

click to vote

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 3 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

113

Voted

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 4 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

112

click to vote

ICRA
2008
IEEE

128views Robotics» more ICRA 2008»

A point-based POMDP planner for target tracking

15 years 8 months ago

Download www.comp.nus.edu.sg

— Target tracking has two variants that are often studied independently with different approaches: target searching requires a robot to ﬁnd a target initially not visible, and ...

David Hsu, Wee Sun Lee, Nan Rong

claim paper

Read More »

106

click to vote

CDC
2008
IEEE

204views Control Systems» more CDC 2008»

Dynamic ping optimization for surveillance in multistatic sonar buoy networks with energy constraints

15 years 9 months ago

Download www.cs.jhu.edu

— In this paper we study the problem of dynamic optimization of ping schedule in an active sonar buoy network deployed to provide persistent surveillance of a littoral area throu...

Anshu Saksena, I-Jeng Wang

claim paper

Read More »

« Prev « First page 33 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers