Search Sciweavers | Sciweavers

115

ICTAI
2005
IEEE

117views Artificial Intelligence» more ICTAI 2005»

Planning with POMDPs Using a Compact, Logic-Based Representation

15 years 7 months ago

Partially Observable Markov Decision Processes (POMDPs) provide a general framework for AI planning, but they lack the structure for representing real world planning problems in a...

Chenggang Wang, James G. Schmolze

claim paper

Read More »

99

click to vote

ATAL
2008
Springer

104views Intelligent Agents» more ATAL 2008»

Expediting RL by using graphical structures

15 years 4 months ago

Download www.cs.washington.edu

The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...

Peng Dai, Alexander L. Strehl, Judy Goldsmith

claim paper

Read More »

123

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 3 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

85

click to vote

ICANN
2007
Springer

122views Neural Networks» more ICANN 2007»

Biasing Neural Networks Towards Exploration or Exploitation Using Neuromodulation

15 years 8 months ago

Download www.parussel.com

Abstract. Taking neuromodulation as a mechanism underlying emotions, this paper investigates how such a mechanism can bias an artiﬁcial neural network towards exploration of new ...

Karla Parussel, Lola Cañamero

claim paper

Read More »

135

click to vote

FORMATS
2003
Springer

125views Formal Methods» more FORMATS 2003»

Performance Analysis of Probabilistic Timed Automata Using Digital Clocks

15 years 7 months ago

Download www.prismmodelchecker.org

Probabilistic timed automata, a variant of timed automata extended with discrete probability distributions, is a speciﬁcation formalism suitable for describing both nondeterminis...

Marta Z. Kwiatkowska, Gethin Norman, David Parker,...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers