Search Sciweavers | Sciweavers

74 search results - page 9 / 15

» Comparing User Simulation Models For Dialog Strategy Learnin...

271

click to vote

CSL
2012
Springer

311views Automated Reasoning» more CSL 2012»

Reinforcement learning for parameter estimation in statistical spoken dialogue systems

14 years 2 months ago

Download mi.eng.cam.ac.uk

Reinforcement techniques have been successfully used to maximise the expected cumulative reward of statistical dialogue systems. Typically, reinforcement learning is used to estim...

Filip Jurcícek, Blaise Thomson, Steve Young

claim paper

Read More »

183

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

192

click to vote

GLOBECOM
2009
IEEE

122views Communications» more GLOBECOM 2009»

Conjectural Equilibrium in Water-Filling Games

15 years 10 months ago

Download medianetlab.ee.ucla.edu

—This paper considers a non-cooperative game in which competing users sharing a frequency-selective interference channel selfishly optimize their power allocation in order to imp...

Yi Su, Mihaela van der Schaar

claim paper

Read More »

207

click to vote

ICDCSW
2005
IEEE

161views Computer Networks» more ICDCSW 2005»

QoS Oriented Dynamic Replica Cost Model for P2P Computing

16 years 13 days ago

Download grid.hust.edu.cn

Replication on multiple nodes is an effective way to improve the availability in the P2P or grid environment. It is difficult to determine how many replicas can fulfill the user r...

Feng Mao, Hai Jin, Deqing Zou, Baoli Chen, Li Qi

claim paper

Read More »

187

click to vote

SDM
2007
SIAM

198views Data Mining» more SDM 2007»

Learning from Time-Changing Data with Adaptive Windowing

15 years 8 months ago

Download siam.org

We present a new approach for dealing with distribution change and concept drift when learning from data sequences that may vary with time. We use sliding windows whose size, inst...

Albert Bifet, Ricard Gavaldà

claim paper

Read More »

« Prev « First page 9 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers