Search Sciweavers | Sciweavers

109 search results - page 13 / 22

» Model Checking Markov Reward Models with Impulse Rewards

click to vote

AAAI
1997

133views Intelligent Agents» more AAAI 1997»

Incremental Methods for Computing Bounds in Partially Observable Markov Decision Processes

13 years 11 months ago

Download www.cs.pitt.edu

Partially observable Markov decision processes (POMDPs) allow one to model complex dynamic decision or control problems that include both action outcome uncertainty and imperfect ...

Milos Hauskrecht

claim paper

Read More »

click to vote

CDC
2009
IEEE

169views Control Systems» more CDC 2009»

Parametric regret in uncertain Markov decision processes

14 years 3 months ago

Download www.cim.mcgill.ca

— We consider decision making in a Markovian setup where the reward parameters are not known in advance. Our performance criterion is the gap between the performance of the best ...

Huan Xu, Shie Mannor

claim paper

Read More »

click to vote

COMPSAC
2009
IEEE

83views Software Engineering» more COMPSAC 2009»

Modeling and Predicting Software Failure Costs

14 years 3 months ago

Download www.grottke.de

—For software, the costs of failures are not clearly understood. Often, these costs disappear in the costs of testing, the general developments costs, or the operating expenses. ...

Michael Grottke, Christian A. Graf

claim paper

Read More »

click to vote

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

14 years 4 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

click to vote

CORR
2007
Springer

143views Education» more CORR 2007»

On Myopic Sensing for Multi-Channel Opportunistic Access

13 years 10 months ago

Download www.ece.ucdavis.edu

We consider a multi-channel opportunistic communication system where the states of these channels evolve as independent and statistically identical Markov chains (the Gilbert-Elli...

Qing Zhao, Bhaskar Krishnamachari, Keqin Liu

claim paper

Read More »

« Prev « First page 13 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers