Search Sciweavers | Sciweavers

1166 search results - page 158 / 234

» Negotiating Using Rewards

196

click to vote

ICML
1994
IEEE

152views Machine Learning» more ICML 1994»

A Modular Q-Learning Architecture for Manipulator Task Decomposition

15 years 10 months ago

Download mi.eng.cam.ac.uk

Compositional Q-Learning (CQ-L) (Singh 1992) is a modular approach to learning to performcomposite tasks made up of several elemental tasks by reinforcement learning. Skills acqui...

Chen K. Tham, Richard W. Prager

claim paper

Read More »

198

Voted

EWRL
2008

104views Machine Learning» more EWRL 2008»

Optimistic Planning of Deterministic Systems

15 years 9 months ago

Download eprints.pascal-network.org

If one possesses a model of a controlled deterministic system, then from any state, one may consider the set of all possible reachable states starting from that state and using any...

Jean-François Hren, Rémi Munos

claim paper

Read More »

200

click to vote

NIPS
2008

159views Information Technology» more NIPS 2008»

Policy Search for Motor Primitives in Robotics

15 years 8 months ago

Download www.kyb.tuebingen.mpg.de

Many motor skills in humanoid robotics can be learned using parametrized motor primitives as done in imitation learning. However, most interesting motor learning problems are high...

Jens Kober, Jan Peters

claim paper

Read More »

157

click to vote

ANLP
1994

80views more ANLP 1994»

Improving Language Models by Clustering Training Sentences

15 years 8 months ago

Download acl.ldc.upenn.edu

Many of the kinds of language model used in speech understanding suffer from imperfect modeling of intra-sentential contextual influences. I argue that this problem can be address...

David M. Carter

claim paper

Read More »

193

click to vote

ATAL
2010
Springer

146views Intelligent Agents» more ATAL 2010»

PAC-MDP learning with knowledge-based admissible models

15 years 7 months ago

Download www.aamas-conference.org

PAC-MDP algorithms approach the exploration-exploitation problem of reinforcement learning agents in an effective way which guarantees that with high probability, the algorithm pe...

Marek Grzes, Daniel Kudenko

claim paper

Read More »

« Prev « First page 158 / 234 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers