Search Sciweavers | Sciweavers

147 search results - page 10 / 30

» Policy Gradient in Continuous Time

160

click to vote

ICRA
2008
IEEE

129views Robotics» more ICRA 2008»

Compliant manipulation for peg-in-hole: Is passive compliance a key to learn contact motion?

16 years 1 months ago

Download groups.csail.mit.edu

— We examine the usefulness of passive compliance in a manipulator that learns contact motion. Based on the notice that humans outperforms robots with the contact motion, we foll...

Seung-kook Yun

claim paper

Read More »

148

click to vote

IJCNN
2000
IEEE

145views Neural Networks» more IJCNN 2000»

The Inefficiency of Batch Training for Large Training Sets

15 years 11 months ago

Download axon.cs.byu.edu

Multilayer perceptrons are often trained using error backpropagation (BP). BP training can be done in either a batch or continuous manner. Claims have frequently been made that bat...

D. Randall Wilson, Tony R. Martinez

claim paper

Read More »

213

click to vote

MST
2011

200views Hardware» more MST 2011»

Performance of Scheduling Policies in Adversarial Networks with Non-synchronized Clocks

15 years 1 months ago

Download www.irisa.fr

In this paper we generalize the Continuous Adversarial Queuing Theory (CAQT) model [5] by considering the possibility that the router clocks in the network are not synchronized. W...

Antonio Fernández Anta, José Luis L&...

claim paper

Read More »

169

click to vote

MM
1994
ACM

90views Multimedia» more MM 1994»

Scheduling Policies for an On-Demand Video Server with Batching

15 years 10 months ago

Download www.cs.sjsu.edu

In an on-demand video server environment, clients make requests for movies to a centralized video server. Due to the stringent response time requirements, continuous delivery of a...

Asit Dan, Dinkar Sitaram, Perwez Shahabuddin

claim paper

Read More »

164

Voted

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

15 years 8 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

« Prev « First page 10 / 30 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers