Sciweavers

771 search results - page 47 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
ECAI
2000
Springer
15 years 7 months ago
Efficient Asymptotic Approximation in Temporal Difference Learning
Abstract. TD(
Frédérick Garcia, Florent Serre
GLOBECOM
2010
IEEE
15 years 2 months ago
Cooperative Relay Scheduling under Partial State Information in Energy Harvesting Sensor Networks
Abstract--Sensors equipped with energy harvesting and cooperative communication capabilities are a viable solution to the power limitations of Wireless Sensor Networks (WSNs) assoc...
Huijiang Li, Neeraj Jaggi, Biplab Sikdar
AAAI
2008
15 years 6 months ago
Unknown Rewards in Finite-Horizon Domains
"Human computation" is a recent approach that extracts information from large numbers of Web users. reCAPTCHA is a human computation project that improves the process of...
Colin McMillen, Manuela M. Veloso
ATAL
2004
Springer
15 years 9 months ago
Learning User Preferences for Wireless Services Provisioning
The problem of interest is how to dynamically allocate wireless access services in a competitive market which implements a take-it-or-leave-it allocation mechanism. In this paper ...
George Lee, Steven Bauer, Peyman Faratin, John Wro...
CORR
2008
Springer
91views Education» more  CORR 2008»
15 years 4 months ago
Significant Diagnostic Counterexamples in Probabilistic Model Checking
Abstract. This paper presents a novel technique for counterexample generation in probabilistic model checking of Markov chains and Markov Decision Processes. (Finite) paths in coun...
Miguel E. Andrés, Pedro R. D'Argenio, Peter...