Search Sciweavers | Sciweavers

1912 search results - page 209 / 383

» Optimizing interconnection policies

126

click to vote

AAAI
2008

99views Intelligent Agents» more AAAI 2008»

Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning

15 years 6 months ago

Download www.cs.mcgill.ca

This paper highlights the crucial role that modern machine learning techniques can play in the optimization of treatment strategies for patients with chronic disorders. In particu...

Arthur Guez, Robert D. Vincent, Massimo Avoli, Joe...

claim paper

Read More »

141

click to vote

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

Symbolic Dynamic Programming for First-order POMDPs

15 years 5 months ago

Download www-kd.iai.uni-bonn.de

Partially-observable Markov decision processes (POMDPs) provide a powerful model for sequential decision-making problems with partially-observed state and are known to have (appro...

Scott Sanner, Kristian Kersting

claim paper

Read More »

149

click to vote

CORR
2011
Springer

209views Education» more CORR 2011»

Close the Gaps: A Learning-while-Doing Algorithm for a Class of Single-Product Revenue Management Problems

14 years 8 months ago

Download www.stanford.edu

In this work, we consider a retailer selling a single product with limited on-hand inventory over a ﬁnite selling season. Customer demand arrives according to a Poisson process,...

Zizhuo Wang, Shiming Deng, Yinyu Ye

claim paper

Read More »

130

click to vote

ATAL
2006
Springer

107views Intelligent Agents» more ATAL 2006»

Winning back the CUP for distributed POMDPs: planning over continuous belief spaces

15 years 7 months ago

Download teamcore.usc.edu

Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are evolving as a popular approach for modeling multiagent systems, and many different algorithms ha...

Pradeep Varakantham, Ranjit Nair, Milind Tambe, Ma...

claim paper

Read More »

127

click to vote

ICML
2003
IEEE

157views Machine Learning» more ICML 2003»

Action Elimination and Stopping Conditions for Reinforcement Learning

16 years 5 months ago

Download www.hpl.hp.com

We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...

Eyal Even-Dar, Shie Mannor, Yishay Mansour

claim paper

Read More »

« Prev « First page 209 / 383 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers