Search Sciweavers | Sciweavers

69 search results - page 12 / 14

» PAC-Bayesian Policy Evaluation for Reinforcement Learning

177

Voted

ATAL
2008
Springer

160views Intelligent Agents» more ATAL 2008»

Sequential decision making in repeated coalition formation under uncertainty

15 years 7 months ago

Download www.aamas-conference.org

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

136

click to vote

AAAI
2008

103views Intelligent Agents» more AAAI 2008»

Reinforcement Learning for Vulnerability Assessment in Peer-to-Peer Networks

15 years 8 months ago

Download web.engr.oregonstate.edu

Proactive assessment of computer-network vulnerability to unknown future attacks is an important but unsolved computer security problem where AI techniques have significant impact...

Scott Dejmal, Alan Fern, Thinh Nguyen

claim paper

Read More »

148

Voted

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Finite-Sample Analysis of LSTD

15 years 6 months ago

Download hal.inria.fr

In this paper we consider the problem of policy evaluation in reinforcement learning, i.e., learning the value function of a fixed policy, using the least-squares temporal-differe...

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

177

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 7 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

218

click to vote

SIGDIAL
2010

137views Natural Language Processing» more SIGDIAL 2010»

Modeling Spoken Decision Making Dialogue and Optimization of its Dialogue Strategy

15 years 3 months ago

Download mastarpj.nict.go.jp

This paper presents a spoken dialogue framework that helps users in making decisions. Users often do not have a definite goal or criteria for selecting from a list of alternatives...

Teruhisa Misu, Komei Sugiura, Kiyonori Ohtake, Chi...

claim paper

Read More »

« Prev « First page 12 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers