Sciweavers

802 search results - page 38 / 161
» Experts in a Markov Decision Process
Sort
View
ICML
2010
IEEE
13 years 8 months ago
Convergence of Least Squares Temporal Difference Methods Under General Conditions
We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...
Huizhen Yu
UAI
2003
13 years 9 months ago
Implementation and Comparison of Solution Methods for Decision Processes with Non-Markovian Rewards
This paper examines a number of solution methods for decision processes with non-Markovian rewards (NMRDPs). They all exploit a temporal logic specification of the reward functio...
Charles Gretton, David Price, Sylvie Thiéba...
ICRA
2010
IEEE
163views Robotics» more  ICRA 2010»
13 years 6 months ago
Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs
Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...
Salvatore Candido, James C. Davidson, Seth Hutchin...
ICTAI
2007
IEEE
14 years 2 months ago
Multi-criteria Decision Making for Local Coordination in Multi-agent Systems
Unlike mono-agent systems, multi-agent planing addresses the problem of resolving conflicts between individual and group interests. In this paper, we are using a Decentralized Ve...
Matthieu Boussard, Maroua Bouzid, Abdel-Illah Moua...
IAAI
1993
13 years 9 months ago
Dodger, a Diagnostic Expert System for the Evaluation of Nondestructive Test Data
Identification and characterization of indications in eddy current (ET) signals can be highly subjective in nature, with varying diagnoses made by different analysts or by a singl...
Arthur J. Levy, Jane E. Oppenlander, David M. Brud...