Search Sciweavers | Sciweavers

682 search results - page 57 / 137

» One-Counter Markov Decision Processes

143

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 10 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

132

click to vote

STACS
1997
Springer

137views Theoretical Computer Science» more STACS 1997»

Methods and Applications of (MAX, +) Linear Algebra

15 years 8 months ago

Download www-rocq.inria.fr

Exotic semirings such as the “(max, +) semiring” (R ∪ {−∞}, max, +), or the “tropical semiring” (N ∪ {+∞}, min, +), have been invented and reinvented many times s...

Stephane Gaubert, Max Plus

claim paper

Read More »

124

Voted

AI
2006
Springer

110views Artificial Intelligence» more AI 2006»

An Efficient Resource Allocation Approach in Real-Time Stochastic Environment

15 years 7 months ago

Download www.damas.ift.ulaval.ca

We are interested in contributing to solving effectively a particular type of real-time stochastic resource allocation problem. Firstly, one distinction is that certain tasks may c...

Pierrick Plamondon, Brahim Chaib-draa, Abder Rezak...

claim paper

Read More »

162

click to vote

SOCO
2010
Springer

148views Software Engineering» more SOCO 2010»

Using evolution strategies to solve DEC-POMDP problems

14 years 10 months ago

Download www.springerlink.com

Decentralized partially observable Markov decision process (DEC-POMDP) is an approach to model multi-robot decision making problems under uncertainty. Since it is NEXP-complete the...

Baris Eker, H. Levent Akin

claim paper

Read More »

133

click to vote

ISCI
2000

98views more ISCI 2000»

Quantum decision-maker

15 years 3 months ago

Download www.cesar.ornl.gov

A quantum device simulating human decision making process is introduced. It consists of quantum recurrent nets generating stochastic processes which represent the motor dynamics, ...

Michail Zak

claim paper

Read More »

« Prev « First page 57 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers