Search Sciweavers | Sciweavers

3381 search results - page 115 / 677

» LEO - DB2's LEarning Optimizer

150

Voted

ICML
2007
IEEE

141views Machine Learning» more ICML 2007»

Reinforcement learning by reward-weighted regression for operational space control

16 years 4 months ago

Download www.machinelearning.org

Many robot control problems of practical importance, including operational space control, can be reformulated as immediate reward reinforcement learning problems. However, few of ...

Jan Peters, Stefan Schaal

claim paper

Read More »

124

Voted

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 9 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

139

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

15 years 5 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

122

click to vote

JFR
2006

108views more JFR 2006»

Learning in a hierarchical control system: 4D/RCS in the DARPA LAGR program

15 years 3 months ago

Download www.isd.mel.nist.gov

The Defense Applied Research Projects Agency (DARPA) Learning Applied to Ground Vehicles (LAGR) program aims to develop algorithms for autonomous vehicle navigation that learn how...

James S. Albus, Roger Bostelman, Tommy Chang, Tsai...

claim paper

Read More »

149

click to vote

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

15 years 9 months ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

« Prev « First page 115 / 677 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers