Search Sciweavers | Sciweavers

1235 search results - page 241 / 247

» ABC Reinforcement Learning

181

click to vote

AAAI
2010

140views Intelligent Agents» more AAAI 2010»

The Model-Based Approach to Autonomous Behavior: A Personal View

15 years 8 months ago

Download www.dtic.upf.edu

The selection of the action to do next is one of the central problems faced by autonomous agents. In AI, three approaches have been used to address this problem: the programming-b...

Hector Geffner

claim paper

Read More »

195

click to vote

UAI
2003

172views Artificial Intelligence» more UAI 2003»

On the Convergence of Bound Optimization Algorithms

15 years 8 months ago

Download cs.nyu.edu

Many practitioners who use EM and related algorithms complain that they are sometimes slow. When does this happen, and what can be done about it? In this paper, we study the gener...

Ruslan Salakhutdinov, Sam T. Roweis, Zoubin Ghahra...

claim paper

Read More »

265

click to vote

KDD
2010
ACM

289views Data Mining» more KDD 2010»

Exploitation and exploration in a performance based contextual advertising system

15 years 4 months ago

Download www.cs.umass.edu

The dynamic marketplace in online advertising calls for ranking systems that are optimized to consistently promote and capitalize better performing ads. The streaming nature of on...

Wei Li 0010, Xuerui Wang, Ruofei Zhang, Ying Cui, ...

claim paper

Read More »

189

click to vote

AGENTS
2000
Springer

119views Security Privacy» more AGENTS 2000»

Adaptivity in agent-based routing for data networks

15 years 11 months ago

Download web.engr.oregonstate.edu

Adaptivity, both of the individual agents and of the interaction structure among the agents, seems indispensable for scaling up multi-agent systems MAS's in noisy environme...

David Wolpert, Sergey Kirshner, Christopher J. Mer...

claim paper

Read More »

177

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 8 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

« Prev « First page 241 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers