Search Sciweavers | Sciweavers

185 search results - page 14 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

129

Voted

ATAL
2005
Springer

117views Intelligent Agents» more ATAL 2005»

Modeling task allocation using a decision theoretic model

15 years 9 months ago

Download dis.cs.umass.edu

Mediation is the process of decomposing a task into subtasks, ﬁnding agents suitable for these subtasks and negotiating with agents to obtain commitments to execute these subtas...

Sherief Abdallah, Victor R. Lesser

claim paper

Read More »

111

Voted

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 4 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

133

Voted

CORR
2008
Springer

189views Education» more CORR 2008»

Algorithms for Dynamic Spectrum Access with Learning for Cognitive Radio

15 years 3 months ago

Download www.ifp.illinois.edu

We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooperati...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

117

Voted

GLOBECOM
2006
IEEE

99views Communications» more GLOBECOM 2006»

Optimal Routing Between Alternate Paths With Different Network Transit Delays

15 years 9 months ago

Download www.cs.ucr.edu

— We consider the path-determination problem in Internet core routers that distribute ﬂows across alternate paths leading to the same destination. We assume that the remainder ...

Essia Hamouda Elhafsi, Mart Molle

claim paper

Read More »

117

Voted

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 4 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

« Prev « First page 14 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers