Search Sciweavers | Sciweavers

1210 search results - page 216 / 242

» Newton's method and its use in optimization

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

13 years 5 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

click to vote

ICMCS
2009
IEEE

98views Multimedia» more ICMCS 2009»

Scalability of HTTP pacing with intelligent bursting

13 years 5 months ago

Download www.cs.unh.edu

While streaming protocols like RTSP/RTP have continued to evolved, HTTP has remained a primary method for Web-based video retrieval. The ubiquity and simplicity of HTTP makes it a...

Kevin J. Ma, Radim Bartos, Swapnil Bhatia

claim paper

Read More »

click to vote

CDC
2010
IEEE

112views Control Systems» more CDC 2010»

Online Convex Programming and regularization in adaptive control

13 years 2 months ago

Download www.mast.queensu.ca

Online Convex Programming (OCP) is a recently developed model of sequential decision-making in the presence of time-varying uncertainty. In this framework, a decisionmaker selects ...

Maxim Raginsky, Alexander Rakhlin, Serdar Yük...

claim paper

Read More »

click to vote

NECO
2011

245views Computer Networks» more NECO 2011»

Least-Squares Independent Component Analysis

13 years 2 months ago

Download sugiyama-www.cs.titech.ac.jp

Accurately evaluating statistical independence among random variables is a key element of Independent Component Analysis (ICA). In this paper, we employ a squared-loss variant of ...

Taiji Suzuki, Masashi Sugiyama

claim paper

Read More »

click to vote

CSL
2010
Springer

238views Automated Reasoning» more CSL 2010»

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

13 years 7 months ago

Download mi.eng.cam.ac.uk

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...

Blaise Thomson, Steve Young

claim paper

Read More »

« Prev « First page 216 / 242 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers