Search Sciweavers | Sciweavers

107 search results - page 20 / 22

» Approximate Linear Programming for Constrained Partially Obs...

113

Voted

CDC
2008
IEEE

204views Control Systems» more CDC 2008»

Dynamic ping optimization for surveillance in multistatic sonar buoy networks with energy constraints

15 years 9 months ago

Download www.cs.jhu.edu

— In this paper we study the problem of dynamic optimization of ping schedule in an active sonar buoy network deployed to provide persistent surveillance of a littoral area throu...

Anshu Saksena, I-Jeng Wang

claim paper

Read More »

139

click to vote

AI
2006
Springer

167views Artificial Intelligence» more AI 2006»

Belief Selection in Point-Based Planning Algorithms for POMDPs

15 years 7 months ago

Download www.cs.mcgill.ca

Abstract. Current point-based planning algorithms for solving partially observable Markov decision processes (POMDPs) have demonstrated that a good approximation of the value funct...

Masoumeh T. Izadi, Doina Precup, Danielle Azar

claim paper

Read More »

121

click to vote

IJCAI
2003

111views Artificial Intelligence» more IJCAI 2003»

Generalizing Plans to New Environments in Relational MDPs

15 years 4 months ago

Download select.cs.cmu.edu

A longstanding goal in planning research is the ability to generalize plans developed for some set of environments to a new but similar environment, with minimal or no replanning....

Carlos Guestrin, Daphne Koller, Chris Gearhart, Ne...

claim paper

Read More »

120

click to vote

ICRA
2008
IEEE

128views Robotics» more ICRA 2008»

A point-based POMDP planner for target tracking

15 years 9 months ago

Download www.comp.nus.edu.sg

— Target tracking has two variants that are often studied independently with different approaches: target searching requires a robot to ﬁnd a target initially not visible, and ...

David Hsu, Wee Sun Lee, Nan Rong

claim paper

Read More »

123

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 4 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 20 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers