Search Sciweavers | Sciweavers

286 search results - page 53 / 58

» Using inaccurate models in reinforcement learning

233

click to vote

CSL
2010
Springer

238views Automated Reasoning» more CSL 2010»

Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems

15 years 7 months ago

Download mi.eng.cam.ac.uk

This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...

Blaise Thomson, Steve Young

claim paper

Read More »

206

Voted

ICML
1998
IEEE

179views Machine Learning» more ICML 1998»

Value Function Based Production Scheduling

16 years 8 months ago

Download www.ri.cmu.edu

Production scheduling, the problem of sequentially con guring a factory to meet forecasted demands, is a critical problem throughout the manufacturing industry. The requirement of...

Jeff G. Schneider, Justin A. Boyan, Andrew W. Moor...

claim paper

Read More »

209

click to vote

IROS
2007
IEEE

172views Robotics» more IROS 2007»

Motor control optimization of compliant one-legged locomotion in rough terrain

16 years 1 months ago

Download groups.csail.mit.edu

— While underactuated robotic systems are capable of energy efﬁcient and rapid dynamic behavior, we still do not fully understand how body dynamics can be actively used for ada...

Fumiya Iida, Russ Tedrake

claim paper

Read More »

182

click to vote

NETCOOP
2007
Springer

130views Computer Networks» more NETCOOP 2007»

Load Shared Sequential Routing in MPLS Networks: System and User Optimal Solutions

16 years 1 months ago

Download www.tsp.ece.mcgill.ca

Recently Gerald Ash has shown through case studies that event dependent routing is attractive in large scale multi-service MPLS networks. In this paper, we consider the application...

Gilles Brunet, Fariba Heidari, Lorne Mason

claim paper

Read More »

195

click to vote

NIPS
2001

101views Information Technology» more NIPS 2001»

The Emergence of Multiple Movement Units in the Presence of Noise and Feedback Delay

15 years 8 months ago

Download books.nips.cc

Tangential hand velocity profiles of rapid human arm movements often appear as sequences of several bell-shaped acceleration-deceleration phases called submovements or movement un...

Michael Kositsky, Andrew G. Barto

claim paper

Read More »

« Prev « First page 53 / 58 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers