Sciweavers

1176 search results - page 132 / 236
» Sparse reward processes
Sort
View
CHI
2006
ACM
16 years 4 months ago
Tensions in designing capture technologies for an evidence-based care community
Evidence-based care is an increasingly popular process for long term diagnosis and monitoring of education and healthcare disabilities. Because this evidence must also be collecte...
Gillian R. Hayes, Gregory D. Abowd
CDC
2008
IEEE
138views Control Systems» more  CDC 2008»
15 years 11 months ago
Modeling and analysis of dynamic decision making in sequential two-choice tasks
—The focus of the work in this paper is the construction and analysis of a dynamical system model for human decision making in sequential two-choice tasks. In these tasks, a huma...
Linh Vu, Kristi A. Morgansen
CDC
2008
IEEE
197views Control Systems» more  CDC 2008»
15 years 11 months ago
Dynamic spectrum access policies for cognitive radio
—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...
Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli
ICRA
2008
IEEE
173views Robotics» more  ICRA 2008»
15 years 11 months ago
Bayesian reinforcement learning in continuous POMDPs with application to robot navigation
— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...
Stéphane Ross, Brahim Chaib-draa, Joelle Pi...
AIIA
2007
Springer
15 years 10 months ago
Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions
The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...
Andrea Bonarini, Alessandro Lazaric, Marcello Rest...