Search Sciweavers | Sciweavers

185 search results - page 32 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

154

Voted

ICDCS
2010
IEEE

167views Distributed And Parallel Com...» more ICDCS 2010»

Stochastic Steepest-Descent Optimization of Multiple-Objective Mobile Sensor Coverage

15 years 7 months ago

Download www.cs.purdue.edu

—We propose a steepest descent method to compute optimal control parameters for balancing between multiple performance objectives in stateless stochastic scheduling, wherein the ...

Chris Y. T. Ma, David K. Y. Yau, Nung Kwan Yip, Na...

claim paper

Read More »

113

click to vote

CPAIOR
2009
Springer

95views Operations Research» more CPAIOR 2009»

Optimal Interdiction of Unreactive Markovian Evaders

15 years 10 months ago

Download math.lanl.gov

The interdiction problem arises in a variety of areas including military logistics, infectious disease control, and counter-terrorism. In the typical formulation of network interdi...

Alexander Gutfraind, Aric A. Hagberg, Feng Pan

claim paper

Read More »

130

click to vote

FSR
2003
Springer

94views Robotics» more FSR 2003»

Planning under Uncertainty for Reliable Health Care Robotics

15 years 8 months ago

Download www.cs.cmu.edu

We describe a mobile robot system, designed to assist residents of an retirement facility. This system is being developed to respond to an aging population and a predicted shortage...

Nicholas Roy, Geoffrey J. Gordon, Sebastian Thrun

claim paper

Read More »

127

Voted

AAAI
2006

146views Intelligent Agents» more AAAI 2006»

Incremental Least Squares Policy Iteration for POMDPs

15 years 4 months ago

Download www.aaai.org

We present a new algorithm, called incremental least squares policy iteration (ILSPI), for finding the infinite-horizon stationary policy for partially observable Markov decision ...

Hui Li, Xuejun Liao, Lawrence Carin

claim paper

Read More »

140

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

16 years 4 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

« Prev « First page 32 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers