Search Sciweavers | Sciweavers

113 search results - page 21 / 23

» Model Approximation for HEXQ Hierarchical Reinforcement Lear...

232

Voted

TSP
2011

230views more TSP 2011»

Bayesian Nonparametric Inference of Switching Dynamic Linear Models

15 years 2 months ago

Download web.mit.edu

—Many complex dynamical phenomena can be effectively modeled by a system that switches among a set of conditionally linear dynamical modes. We consider two such models: the switc...

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan,...

claim paper

Read More »

227

click to vote

ICML
1999
IEEE

168views Machine Learning» more ICML 1999»

Least-Squares Temporal Difference Learning

16 years 8 months ago

Download www.research.rutgers.edu

Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...

Justin A. Boyan

claim paper

Read More »

205

Voted

NECO
2007

127views more NECO 2007»

Visual Recognition and Inference Using Dynamic Overcomplete Sparse Learning

15 years 7 months ago

Download dsp.ucsd.edu

We present a hierarchical architecture and learning algorithm for visual recognition and other visual inference tasks such as imagination, reconstruction of occluded images, and e...

Joseph F. Murray, Kenneth Kreutz-Delgado

claim paper

Read More »

195

Voted

NIPS
2001

101views Information Technology» more NIPS 2001»

The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay

15 years 8 months ago

Download books.nips.cc

Tangential hand velocity profiles of rapid human arm movements often appear as sequences of several bell-shaped acceleration-deceleration phases called submovements or movement un...

Michael Kositsky, Andrew G. Barto

claim paper

Read More »

233

click to vote

CSL
2010
Springer

238views Automated Reasoning» more CSL 2010»

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

15 years 7 months ago

Download mi.eng.cam.ac.uk

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...

Blaise Thomson, Steve Young

claim paper

Read More »

« Prev « First page 21 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers