Search Sciweavers | Sciweavers

374 search results - page 29 / 75

» Multiagent Reinforcement Learning: Theoretical Framework and...

click to vote

ICASSP
2011
IEEE

204views Signal Processing» more ICASSP 2011»

Bayesian reinforcement learning for POMDP-based dialogue systems

12 years 11 months ago

Download mirlab.org

Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...

ShaoWei Png, Joelle Pineau

claim paper

Read More »

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

14 years 8 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

click to vote

NIPS
2004

138views Information Technology» more NIPS 2004»

New Criteria and a New Algorithm for Learning in Multi-Agent Systems

13 years 9 months ago

Download books.nips.cc

We propose a new set of criteria for learning algorithms in multi-agent systems, one that is more stringent and (we argue) better justified than previous proposed criteria. Our cr...

Rob Powers, Yoav Shoham

claim paper

Read More »

click to vote

IAT
2005
IEEE

138views Intelligent Agents» more IAT 2005»

Multiagent Reputation Management to Achieve Robust Software Using Redundancy

14 years 1 months ago

Download www.cse.sc.edu

This paper explains the building of robust software using multiagent reputation. One of the major goals of software engineering is to achieve robust software. Our hypothesis is th...

Rajesh Turlapati, Michael N. Huhns

claim paper

Read More »

click to vote

ICAC
2009
IEEE

226views Applied Computing» more ICAC 2009»

Using distributed w-learning for multi-policy optimization in decentralized autonomic systems

13 years 5 months ago

Download www.scss.tcd.ie

Distributed W-Learning (DWL) is a reinforcement learningbased algorithm for multi-policy optimization in agent-based systems. In this poster we propose the use of DWL for decentra...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

« Prev « First page 29 / 75 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers