Search Sciweavers | Sciweavers

1262 search results - page 246 / 253

» Reinforcement Learning: An Introduction

295

Voted

KDD
2010
ACM

289views Data Mining» more KDD 2010»

Exploitation and exploration in a performance based contextual advertising system

15 years 5 months ago

Download www.cs.umass.edu

The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...

Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...

claim paper

Read More »

249

Voted

EMNLP
2011

164views Natural Language Processing» more EMNLP 2011»

Watermarking the Outputs of Structured Prediction with an application in Statistical Machine Translation

14 years 7 months ago

Download cs.jhu.edu

We propose a general method to watermark and probabilistically identify the structured outputs of machine learning algorithms. Our method is robust to local editing operations and...

Ashish Venugopal, Jakob Uszkoreit, David Talbot, F...

claim paper

Read More »

213

Voted

AGENTS
2000
Springer

119views Security Privacy» more AGENTS 2000»

Adaptivity in agent-based routing for data networks

15 years 11 months ago

Download web.engr.oregonstate.edu

Adaptivity, both of the individual agents and of the interaction structure among the agents, seems indispensable for scaling up multi-agent systems MAS's in noisy environme...

David Wolpert, Sergey Kirshner, Christopher J. Mer...

claim paper

Read More »

191

Voted

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 9 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

196

Voted

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

« Prev « First page 246 / 253 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers