Search Sciweavers | Sciweavers

147 search results - page 20 / 30

» Policy Gradient in Continuous Time

133

click to vote

ML
2006
ACM

99views Machine Learning» more ML 2006»

Universal parameter optimisation in games based on SPSA

15 years 3 months ago

Download www.jhuapl.edu

Most game programs have a large number of parameters that are crucial for their performance. While tuning these parameters by hand is rather difficult, efficient and easy to use ge...

Levente Kocsis, Csaba Szepesvári

claim paper

Read More »

115

click to vote

ICASSP
2009
IEEE

124views Signal Processing» more ICASSP 2009»

Extended VTS for noise-robust speech recognition

15 years 10 months ago

Download mi.eng.cam.ac.uk

Model compensation is a standard way of improving the robustness of speech recognition systems to noise. A number of popular schemes are based on vector Taylor series (vts) compen...

Rogier C. van Dalen, Mark J. F. Gales

claim paper

Read More »

126

click to vote

CVPR
2010
IEEE

275views Computer Vision» more CVPR 2010»

Discontinuous Seam-Carving for Video Retargeting

15 years 6 months ago

Download www.cc.gatech.edu

We introduce a new algorithm for video retargeting that uses discontinuous seam-carving in both space and time for resizing videos. Our algorithm relies on a novel appearance-base...

Matthias Grundmann, Vivek Kwatra, Mei Han, Irfan E...

claim paper

Read More »

142

click to vote

ICRA
2006
IEEE

161views Robotics» more ICRA 2006»

Quadruped Robot Obstacle Negotiation via Reinforcement Learning

15 years 9 months ago

Download www.stanford.edu

— Legged robots can, in principle, traverse a large variety of obstacles and terrains. In this paper, we describe a successful application of reinforcement learning to the proble...

Honglak Lee, Yirong Shen, Chih-Han Yu, Gurjeet Sin...

claim paper

Read More »

138

Voted

NIPS
2003

105views Information Technology» more NIPS 2003»

Gaussian Processes in Reinforcement Learning

15 years 4 months ago

Download books.nips.cc

We exploit some useful properties of Gaussian process (GP) regression models for reinforcement learning in continuous state spaces and discrete time. We demonstrate how the GP mod...

Carl Edward Rasmussen, Malte Kuss

claim paper

Read More »

« Prev « First page 20 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers