Search Sciweavers | Sciweavers

252 search results - page 12 / 51

» Optimal Sequential Exploration: A Binary Learning Model

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

12 years 6 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

13 years 5 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

When Random Play is Optimal Against an Adversary

13 years 9 months ago

Download www.cs.berkeley.edu

We analyze a sequential game between a Gambler and a Casino. The Gambler allocates bets from a limited budget over a fixed menu of gambling events that are offered at equal time i...

Jacob Abernethy, Manfred K. Warmuth, Joel Yellin

claim paper

Read More »

click to vote

ASPDAC
2004
ACM

120views Hardware» more ASPDAC 2004»

Compiler based exploration of DSP energy savings by SIMD operations

14 years 1 months ago

Download ls12-www.cs.tu-dortmund.de

— The growing use of digital signal processors (DSPs) in embedded systems necessitates the use of optimizing compilers supporting their special architecture features. Beside the ...

Markus Lorenz, Peter Marwedel, Thorsten Dräge...

claim paper

Read More »

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence, Targeted Optimality, and Safety in Multiagent Learning

13 years 8 months ago

Download www.cs.utexas.edu

This paper introduces a novel multiagent learning algorithm, Convergence with Model Learning and Safety (or CMLeS in short), which achieves convergence, targeted optimality agains...

Doran Chakraborty, Peter Stone

claim paper

Read More »

« Prev « First page 12 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers