Search Sciweavers | Sciweavers

458 search results - page 85 / 92

» Q-Decomposition for Reinforcement Learning Agents

150

click to vote

FLAIRS
2004

188views Artificial Intelligence» more FLAIRS 2004»

Intelligent Control of Closed-Loop Sedation in Simulated ICU Patients

15 years 7 months ago

Download www.aaai.org

The intensive care unit is a challenging environment to both patient and caregiver. Continued shortages in staffing, principally in nursing, increase risk to patient and healthcar...

Brett L. Moore, Eric D. Sinzinger, Todd M. Quasny,...

claim paper

Read More »

135

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

15 years 20 days ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

150

Voted

GECCO
2009
Springer

200views Optimization» more GECCO 2009»

Apply ant colony optimization to Tetris

16 years 8 days ago

Download cs.nju.edu.cn

Tetris is a falling block game where the player’s objective is to arrange a sequence of diﬀerent shaped tetrominoes smoothly in order to survive. In the intelligence games, ag...

Xingguo Chen, Hao Wang, Weiwei Wang, Yinghuan Shi,...

claim paper

Read More »

159

Voted

ATAL
2008
Springer

155views Intelligent Agents» more ATAL 2008»

Approximate predictive state representations

15 years 7 months ago

Download www.aamas-conference.org

Predictive state representations (PSRs) are models that represent the state of a dynamical system as a set of predictions about future events. The existing work with PSRs focuses ...

Britton Wolfe, Michael R. James, Satinder P. Singh

claim paper

Read More »

198

click to vote

GLOBECOM
2008
IEEE

169views Communications» more GLOBECOM 2008»

Autonomous Network Management Using Cooperative Learning for Network-Wide Load Balancing in Heterogeneous Networks

15 years 5 months ago

Download sierra.ece.ucdavis.edu

Traditional hop-by-hop dynamic routing makes inefficient use of network resources as it forwards packets along already congested shortest paths while uncongested longer paths may b...

Minsoo Lee, Xiaohui Ye, Dan Marconett, Samuel John...

claim paper

Read More »

« Prev « First page 85 / 92 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers