Search Sciweavers | Sciweavers

423 search results - page 27 / 85

» Multi-objective Model Checking of Markov Decision Processes

133

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 1 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

105

Voted

AAAI
2010

144views Intelligent Agents» more AAAI 2010»

Representation Discovery in Sequential Decision Making

15 years 4 months ago

Download www.cs.umass.edu

Automatically constructing novel representations of tasks from analysis of state spaces is a longstanding fundamental challenge in AI. I review recent progress on this problem for...

Sridhar Mahadevan

claim paper

Read More »

123

click to vote

CAV
2009
Springer

156views Hardware» more CAV 2009»

Towards Performance Prediction of Compositional Models in Industrial GALS Designs

15 years 10 months ago

Download ftp.inrialpes.fr

Systems and Networks on Chips (NoCs) are a prime design focus of many hardware manufacturers. In addition to functional veriﬁcation, which is a diﬃcult necessity, the chip desi...

Nicolas Coste, Holger Hermanns, Etienne Lantreibec...

claim paper

Read More »

162

click to vote

JSAC
2008

95views more JSAC 2008»

Cognitive Medium Access: Constraining Interference Based on Experimental Models

15 years 1 months ago

Download acsp.ece.cornell.edu

In this paper we design a cognitive radio that can coexist with multiple parallel WLAN channels while abiding by an interference constraint. The interaction between both systems is...

Stefan Geirhofer, Lang Tong, Brian M. Sadler

claim paper

Read More »

123

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

15 years 9 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

« Prev « First page 27 / 85 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers