Search Sciweavers | Sciweavers

718 search results - page 103 / 144

» Online Experiments: Lessons Learned

143

Voted

AAAI
2006

190views Intelligent Agents» more AAAI 2006»

Action Selection in Bayesian Reinforcement Learning

15 years 4 months ago

Download www.aaai.org

My research attempts to address on-line action selection in reinforcement learning from a Bayesian perspective. The idea is to develop more effective action selection techniques b...

Tao Wang

claim paper

Read More »

Voted

NIPS
2007

127views Information Technology» more NIPS 2007»

On higher-order perceptron algorithms

15 years 4 months ago

Download books.nips.cc

A new algorithm for on-line learning linear-threshold functions is proposed which efﬁciently combines second-order statistics about the data with the ”logarithmic behavior” ...

Claudio Gentile, Fabio Vitale, Cristian Brotto

claim paper

Read More »

137

click to vote

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Exponentiated gradient algorithms for log-linear structured prediction

16 years 4 months ago

Download www.machinelearning.org

Conditional log-linear models are a commonly used method for structured prediction. Efficient learning of parameters in these models is therefore an important problem. This paper ...

Amir Globerson, Terry Koo, Xavier Carreras, Michae...

claim paper

Read More »

163

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 10 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

148

Voted

HPDC
2000
IEEE

202views Distributed And Parallel Com...» more HPDC 2000»

Creating Large Scale Database Servers

15 years 7 months ago

Download slac.stanford.edu

The BaBar experiment at the Stanford Linear Accelerator Center (SLAC) is designed to perform a high precision investigation of the decays of the B-meson produced from electron-pos...

Jacek Becla, Andrew Hanushevsky

claim paper

Read More »

« Prev « First page 103 / 144 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers