Search Sciweavers | Sciweavers

332 search results - page 38 / 67

» Ranking policies in discrete Markov decision processes

click to vote

AAAI
1996

119views Intelligent Agents» more AAAI 1996»

Rewarding Behaviors

13 years 9 months ago

Download www.cs.toronto.edu

Markov decision processes (MDPs) are a very popular tool for decision theoretic planning (DTP), partly because of the welldeveloped, expressive theory that includes effective solu...

Fahiem Bacchus, Craig Boutilier, Adam J. Grove

claim paper

Read More »

click to vote

ECAI
2008
Springer

114views Artificial Intelligence» more ECAI 2008»

A hybrid approach to multi-agent decision-making

13 years 9 months ago

Download www.deetc.isel.ipl.pt

Abstract. In the aftermath of a large-scale disaster, agents’ decisions derive from self-interested (e.g. survival), common-good (e.g. victims’ rescue) and teamwork (e.g. ﬁre...

Paulo Trigo, Helder Coelho

claim paper

Read More »

click to vote

GLOBECOM
2007
IEEE

116views Communications» more GLOBECOM 2007»

Cross-Layer Call Admission Control for a CDMA Uplink Employing a Base-Station Antenna Array

14 years 2 months ago

Download post.queensu.ca

— A novel cross-layer call admission control policy is proposed for a general CDMA beamforming system. In contrast to previously proposed call admission control (CAC) policies wh...

Wei Sheng, Steven D. Blostein

claim paper

Read More »

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

12 years 3 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

« Prev « First page 38 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers