Sciweavers

771 search results - page 51 / 155
» Markov Decision Processes with Arbitrary Reward Processes
Sort
View
141
Voted
ICRA
2008
IEEE
173views Robotics» more  ICRA 2008»
15 years 10 months ago
Bayesian reinforcement learning in continuous POMDPs with application to robot navigation
— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...
Stéphane Ross, Brahim Chaib-draa, Joelle Pi...
ICAC
2005
IEEE
15 years 9 months ago
Self-Optimizing Architecture for QoS Provisioning in Differentiated Services
This paper presents a scalable and self-optimizing architecture for Quality-of-Service (QoS) provisioning in the Differentiated Services (DiffServ) framework. The proposed archite...
Daniel Yagan, Chen-Khong Tham
NIPS
2008
15 years 5 months ago
Biasing Approximate Dynamic Programming with a Lower Discount Factor
Most algorithms for solving Markov decision processes rely on a discount factor, which ensures their convergence. It is generally assumed that using an artificially low discount f...
Marek Petrik, Bruno Scherrer
ICASSP
2008
IEEE
15 years 10 months ago
Distributed multi-dimensional hidden Markov models for image and trajectory-based video classifications
In this paper, we propose a novel multi-dimensional distributed hidden Markov model (DHMM) framework. We first extend the theory of 2D hidden Markov models (HMMs) to arbitrary ca...
Xiang Ma, Dan Schonfeld, Ashfaq A. Khokhar
137
Voted
ICTAI
2007
IEEE
15 years 10 months ago
Multi-criteria Decision Making for Local Coordination in Multi-agent Systems
Unlike mono-agent systems, multi-agent planing addresses the problem of resolving conflicts between individual and group interests. In this paper, we are using a Decentralized Ve...
Matthieu Boussard, Maroua Bouzid, Abdel-Illah Moua...