Search Sciweavers | Sciweavers

1227 search results - page 184 / 246

» Learning Rates for Q-Learning

179

click to vote

ICML
2010
IEEE

228views Machine Learning» more ICML 2010»

Clustering processes

15 years 8 months ago

Download daniil.ryabko.net

The problem of clustering is considered, for the case when each data point is a sample generated by a stationary ergodic process. We propose a very natural asymptotic notion of co...

Daniil Ryabko

claim paper

Read More »

256

click to vote

ICML
2010
IEEE

204views Machine Learning» more ICML 2010»

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

15 years 8 months ago

Download www.its.caltech.edu

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multiarmed bandit problem, where the payoff function is ...

Niranjan Srinivas, Andreas Krause, Sham Kakade, Ma...

claim paper

Read More »

186

click to vote

AAAI
2010

193views Intelligent Agents» more AAAI 2010»

The Boosting Effect of Exploratory Behaviors

15 years 7 months ago

Download home.engineering.iastate.edu

Active object exploration is one of the hallmarks of human and animal intelligence. Research in psychology has shown that the use of multiple exploratory behaviors is crucial for ...

Jivko Sinapov, Alexander Stoytchev

claim paper

Read More »

178

click to vote

ICANN
2010
Springer

164views Neural Networks» more ICANN 2010»

Multi-Dimensional Deep Memory Atari-Go Players for Parameter Exploring Policy Gradients

15 years 7 months ago

Download www.idsia.ch

Abstract. Developing superior artificial board-game players is a widelystudied area of Artificial Intelligence. Among the most challenging games is the Asian game of Go, which, des...

Mandy Grüttner, Frank Sehnke, Tom Schaul, J&u...

claim paper

Read More »

174

click to vote

CORR
2010
Springer

64views Education» more CORR 2010»

Selfish Response to Epidemic Propagation

15 years 7 months ago

Download infoscience.epfl.ch

An epidemic spreading in a network calls for a decision on the part of the network members: They should decide whether to protect themselves or not. Their decision depends on the ...

George Theodorakopoulos, Jean-Yves Le Boudec, John...

claim paper

Read More »

« Prev « First page 184 / 246 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers