Search Sciweavers | Sciweavers

4544 search results - page 19 / 909

» Reinforcement Learning with Time

185

click to vote

CEC
2007
IEEE

126views Artificial Intelligence» more CEC 2007»

Double-deck elevator systems using Genetic Network Programming with reinforcement learning

15 years 11 months ago

Download www.cs.york.ac.uk

Abstract-- In order to increase the transportation capability of elevator group systems in high-rise buildings without adding elevator installation space, double-deck elevator syst...

Jin Zhou, Lu Yu, Shingo Mabu, Kotaro Hirasawa, Jin...

claim paper

Read More »

185

click to vote

NN
2006
Springer

127views Neural Networks» more NN 2006»

The asymptotic equipartition property in reinforcement learning and its relation to return maximization

15 years 7 months ago

Download www.ece.uvic.ca

We discuss an important property called the asymptotic equipartition property on empirical sequences in reinforcement learning. This states that the typical set of empirical seque...

Kazunori Iwata, Kazushi Ikeda, Hideaki Sakai

claim paper

Read More »

229

click to vote

PKDD
2010
Springer

129views Data Mining» more PKDD 2010»

Smarter Sampling in Model-Based Bayesian Reinforcement Learning

15 years 5 months ago

Download www.cs.mcgill.ca

Abstract. Bayesian reinforcement learning (RL) is aimed at making more efﬁcient use of data samples, but typically uses signiﬁcantly more computation. For discrete Markov Decis...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

197

Voted

ATAL
2007
Springer

146views Intelligent Agents» more ATAL 2007»

Transfer via inter-task mappings in policy search reinforcement learning

16 years 1 months ago

Download userweb.cs.utexas.edu

The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have f...

Matthew E. Taylor, Shimon Whiteson, Peter Stone

claim paper

Read More »

174

Voted

AAAI
2004

135views Intelligent Agents» more AAAI 2004»

Performance Bounded Reinforcement Learning in Strategic Interactions

15 years 9 months ago

Download www.aaai.org

Despite increasing deployment of agent technologies in several business and industry domains, user confidence in fully automated agent driven applications is noticeably lacking. T...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

« Prev « First page 19 / 909 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers