Search Sciweavers | Sciweavers

109 search results - page 4 / 22

» Model Checking Markov Reward Models with Impulse Rewards

285

click to vote

Publication

233views

Sparse reward processes

14 years 4 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

192

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 11 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

161

click to vote

CORR
2010
Springer

136views Education» more CORR 2010»

The Highest Expected Reward Decoding for HMMs with Application to Recombination Detection

15 years 3 months ago

Download compbio.fmph.uniba.sk

Abstract. Hidden Markov models are traditionally decoded by the Viterbi algorithm which finds the highest probability state path in the model. In recent years, several limitations ...

Michal Nánási, Tomás Vinar, B...

claim paper

Read More »

190

click to vote

MASCOTS
1996

99views Modeling And Simulation» more MASCOTS 1996»

Well-Defined Stochastic Petri Nets

15 years 7 months ago

Download www.cs.ucr.edu

Formalisms based on stochastic Petri Nets (SPNs) can employ structural analysis to ensure that the underlying stochastic process is fully determined. The focus is on the detection...

Gianfranco Ciardo, Robert Zijal

claim paper

Read More »

147

Voted

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 6 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 4 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers