Search Sciweavers | Sciweavers

332 search results - page 57 / 67

» Ranking policies in discrete Markov decision processes

click to vote

ATAL
2011
Springer

169views Intelligent Agents» more ATAL 2011»

Towards a unifying characterization for quantifying weak coupling in dec-POMDPs

12 years 8 months ago

Download ai.eecs.umich.edu

Researchers in the ﬁeld of multiagent sequential decision making have commonly used the terms “weakly-coupled” and “loosely-coupled” to qualitatively classify problems i...

Stefan J. Witwicki, Edmund H. Durfee

claim paper

Read More »

click to vote

CN
2004

138views more CN 2004»

Performance evaluation of prioritized scheduling with buffer management for differentiated services architectures

13 years 7 months ago

Download trl.cs.queensu.ca

Differentiated services (DiffServ) is an architecture for the Internet in which various applications are supported using a simple classification scheme. Packets entering the DiffS...

Ahmed E. Kamal, Hossam S. Hassanein

claim paper

Read More »

click to vote

LION
2007
Springer

192views Optimization» more LION 2007»

Learning While Optimizing an Unknown Fitness Surface

14 years 2 months ago

Download www.science.unitn.it

This paper is about Reinforcement Learning (RL) applied to online parameter tuning in Stochastic Local Search (SLS) methods. In particular a novel application of RL is considered i...

Roberto Battiti, Mauro Brunato, Paolo Campigotto

claim paper

Read More »

click to vote

NIPS
1998

137views Information Technology» more NIPS 1998»

Risk Sensitive Reinforcement Learning

13 years 9 months ago

Download www.cs.cmu.edu

In this paper, we consider Markov Decision Processes (MDPs) with error states. Error states are those states entering which is undesirable or dangerous. We define the risk with re...

Ralph Neuneier, Oliver Mihatsch

claim paper

Read More »

click to vote

AAAI
2012

220views Intelligent Agents» more AAAI 2012»

A Dynamic Rationalization of Distance Rationalizability

11 years 10 months ago

Download www.cs.cmu.edu

Distance rationalizability is an intuitive paradigm for developing and studying voting rules: given a notion of consensus and a distance function on preference proﬁles, a ration...

Craig Boutilier, Ariel D. Procaccia

claim paper

Read More »

« Prev « First page 57 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers