Search Sciweavers | Sciweavers

185 search results - page 24 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

120

Voted

EENERGY
2010

150views Computer Networks» more EENERGY 2010»

Optimal sleep patterns for serving delay-tolerant jobs

15 years 7 months ago

Download www.princeton.edu

Sleeping is an important method to reduce energy consumption in many information and communication systems. In this paper we focus on a typical server under dynamic load, where en...

Ioannis Kamitsos, Lachlan L. H. Andrew, Hongseok K...

claim paper

Read More »

116

Voted

CDC
2008
IEEE

204views Control Systems» more CDC 2008»

Dynamic ping optimization for surveillance in multistatic sonar buoy networks with energy constraints

15 years 10 months ago

Download www.cs.jhu.edu

— In this paper we study the problem of dynamic optimization of ping schedule in an active sonar buoy network deployed to provide persistent surveillance of a littoral area throu...

Anshu Saksena, I-Jeng Wang

claim paper

Read More »

150

Voted

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

15 years 6 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

126

Voted

ICML
2003
IEEE

124views Machine Learning» more ICML 2003»

Exploration in Metric State Spaces

16 years 4 months ago

Download www.cis.upenn.edu

We present metric?? , a provably near-optimal algorithm for reinforcement learning in Markov decision processes in which there is a natural metric on the state space that allows t...

Sham Kakade, Michael J. Kearns, John Langford

claim paper

Read More »

122

Voted

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

15 years 5 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

« Prev « First page 24 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers