Search Sciweavers | Sciweavers

380 search results - page 71 / 76

» Learning Recursive Control Programs from Problem Solving

224

click to vote

GECCO
2007
Springer

173views Optimization» more GECCO 2007»

UCSpv: principled voting in UCS rule populations

16 years 23 days ago

Download www.cs.man.ac.uk

Credit assignment is a fundamental issue for the Learning Classiﬁer Systems literature. We engage in a detailed investigation of credit assignment in one recent system called UC...

Gavin Brown, Tim Kovacs, James A. R. Marshall

claim paper

Read More »

191

click to vote

ATAL
2005
Springer

197views Intelligent Agents» more ATAL 2005»

Coordinating multiple rovers with interdependent science objectives

16 years 4 days ago

Download www-aig.jpl.nasa.gov

This paper describes an integrated system for coordinating multiple rover behavior with the overall goal of collecting planetary surface data. The MISUS system combines techniques...

Tara A. Estlin, Daniel M. Gaines, Forest Fisher, R...

claim paper

Read More »

213

click to vote

CORR
2008
Springer

118views Education» more CORR 2008»

Distributed Constrained Optimization with Semicoordinate Transformations

15 years 6 months ago

Download ti.arc.nasa.gov

Recent work has shown how information theory extends conventional full-rationality game theory to allow bounded rational agents. The associated mathematical framework can be used ...

William G. Macready, David Wolpert

claim paper

Read More »

169

click to vote

AAAI
1996

125views Intelligent Agents» more AAAI 1996»

The NASA Personnel Security Processing Expert System

15 years 8 months ago

Download www.aaai.org

The NASA Personnel Security Processing Expert System is a tool that automatically determines the appropriate personnel background investigation required for a civil servant or con...

David Silberberg, Robert Thomas

claim paper

Read More »

202

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 6 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

« Prev « First page 71 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers