Search Sciweavers | Sciweavers

672 search results - page 55 / 135

» Policy Search by Dynamic Programming

click to vote

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

13 years 9 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

click to vote

INFOCOM
2007
IEEE

189views Communications» more INFOCOM 2007»

Cost and Collision Minimizing Forwarding Schemes for Wireless Sensor Networks

14 years 2 months ago

Download www.dei.unipd.it

—The paper presents a novel integrated MAC/routing scheme for wireless sensor networking. Our design objective is to elect the next hop for data forwarding by minimizing the numb...

Michele Rossi, Nicola Bui, Michele Zorzi

claim paper

Read More »

click to vote

SWAT
2004
Springer

120views Algorithms» more SWAT 2004»

Railway Delay Management: Exploring Its Algorithmic Complexity

14 years 1 months ago

Download www.inf.ethz.ch

We consider delay management in railway systems. Given delayed trains, we want to ﬁnd a waiting policy for the connecting trains minimizing the weighted total passenger delay. If...

Michael Gatto, Björn Glaus, Riko Jacob, Leon ...

claim paper

Read More »

click to vote

INFOCOM
2002
IEEE

73views Communications» more INFOCOM 2002»

Optimal Energy Allocation and Admission Control for Communications Satellites

14 years 23 days ago

Download www.mit.edu

—We address the issue of optimal energy allocation and admission control for communications satellites in earth orbit. Such satellites receive requests for transmission as they o...

Alvin Fu, Eytan Modiano, John N. Tsitsiklis

claim paper

Read More »

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

14 years 6 days ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

« Prev « First page 55 / 135 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers