Search Sciweavers | Sciweavers

176 search results - page 19 / 36

» Approximation and Exactness in Finite State Optimality Theor...

174

click to vote

PKDD
2009
Springer

152views Data Mining» more PKDD 2009»

Feature Selection for Value Function Approximation Using Bayesian Model Selection

16 years 21 days ago

Download userweb.cs.utexas.edu

Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...

Tobias Jung, Peter Stone

claim paper

Read More »

155

click to vote

AAMAS
2007
Springer

161views Intelligent Agents» more AAMAS 2007»

Optimal Control in Large Stochastic Multi-agent Systems

16 years 11 days ago

Download www.snn.ru.nl

Abstract. We study optimal control in large stochastic multi-agent systems in continuous space and time. We consider multi-agent systems where agents have independent dynamics with...

Bart van den Broek, Wim Wiegerinck, Bert Kappen

claim paper

Read More »

185

click to vote

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

15 years 8 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

166

click to vote

INFOCOM
2003
IEEE

145views Communications» more INFOCOM 2003»

Time-Optimal Network Queue Control: The Case of a Single Congested Node

15 years 11 months ago

Download www.ieee-infocom.org

-We solve the problem of time-optimal network queue control: what are the input data rates that make network queue sizes converge to their ideal size in the least possible time aft...

Mahadevan Iyer, Wei Kang Tsai

claim paper

Read More »

156

click to vote

DM
2010

107views more DM 2010»

An analytic approach to stability

15 years 6 months ago

Download www.math.cmu.edu

The stability method is very useful for obtaining exact solutions of many extremal graph problems. Its key step is to establish the stability property which, roughly speaking, sta...

Oleg Pikhurko

claim paper

Read More »

« Prev « First page 19 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers