Search Sciweavers | Sciweavers

458 search results - page 49 / 92

» Q-Decomposition for Reinforcement Learning Agents

152

click to vote

AAAI
2008

99views Intelligent Agents» more AAAI 2008»

Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning

15 years 8 months ago

Download www.cs.mcgill.ca

This paper highlights the crucial role that modern machine learning techniques can play in the optimization of treatment strategies for patients with chronic disorders. In particu...

Arthur Guez, Robert D. Vincent, Massimo Avoli, Joe...

claim paper

Read More »

146

click to vote

AAAI
2010

134views Intelligent Agents» more AAAI 2010»

Reinforcement Learning Via Practice and Critique Advice

15 years 7 months ago

Download web.engr.oregonstate.edu

We consider the problem of incorporating end-user advice into reinforcement learning (RL). In our setting, the learner alternates between practicing, where learning is based on ac...

Kshitij Judah, Saikat Roy, Alan Fern, Thomas G. Di...

claim paper

Read More »

169

click to vote

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

15 years 10 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

172

click to vote

ATAL
2007
Springer

151views Intelligent Agents» more ATAL 2007»

Batch reinforcement learning in a complex domain

16 years 4 days ago

Download userweb.cs.utexas.edu

Temporal diﬀerence reinforcement learning algorithms are perfectly suited to autonomous agents because they learn directly from an agent’s experience based on sequential actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

143

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Optimizing Anthrax Outbreak Detection Using Reinforcement Learning

15 years 8 months ago

Download www.aaai.org

The potentially catastrophic impact of a bioterrorist attack makes developing effective detection methods essential for public health. In the case of anthrax attack, a delay of ho...

Masoumeh T. Izadi, David L. Buckeridge

claim paper

Read More »

« Prev « First page 49 / 92 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers