Sciweavers

682 search results - page 57 / 137
» One-Counter Markov Decision Processes
Sort
View
143
Voted
ATAL
2009
Springer
15 years 10 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
STACS
1997
Springer
15 years 8 months ago
Methods and Applications of (MAX, +) Linear Algebra
Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...
Stephane Gaubert, Max Plus
124
Voted
AI
2006
Springer
15 years 7 months ago
An Efficient Resource Allocation Approach in Real-Time Stochastic Environment
We are interested in contributing to solving effectively a particular type of real-time stochastic resource allocation problem. Firstly, one distinction is that certain tasks may c...
Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak...
SOCO
2010
Springer
14 years 10 months ago
Using evolution strategies to solve DEC-POMDP problems
Decentralized partially observable Markov decision process (DEC-POMDP) is an approach to model multi-robot decision making problems under uncertainty. Since it is NEXP-complete the...
Baris Eker, H. Levent Akin
ISCI
2000
98views more  ISCI 2000»
15 years 3 months ago
Quantum decision-maker
A quantum device simulating human decision making process is introduced. It consists of quantum recurrent nets generating stochastic processes which represent the motor dynamics, ...
Michail Zak