Search Sciweavers | Sciweavers

2045 search results - page 18 / 409

» Learning programming with Erlang

163

click to vote

ESANN
2006

114views Neural Networks» more ESANN 2006»

Reducing policy degradation in neuro-dynamic programming

15 years 8 months ago

Download ml.informatik.uni-freiburg.de

We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...

Thomas Gabel, Martin Riedmiller

claim paper

Read More »

193

click to vote

ICML
2008
IEEE

119views Machine Learning» more ICML 2008»

Message-passing for graph-structured linear programs: proximal projections, convergence and rounding schemes

16 years 7 months ago

Download www.stat.berkeley.edu

Pradeep D. Ravikumar, Alekh Agarwal, Martin J. Wai...

claim paper

Read More »

143

click to vote

ICML
2005
IEEE

121views Machine Learning» more ICML 2005»

Integer linear programming inference for conditional random fields

16 years 7 months ago

Download l2r.cs.uiuc.edu

Dan Roth, Wen-tau Yih

claim paper

Read More »

143

click to vote

COLT
1992
Springer

99views Machine Learning» more COLT 1992»

PAC-Learnability of Determinate Logic Programs

15 years 10 months ago

Download www.doc.ic.ac.uk

Saso Dzeroski, Stephen Muggleton, Stuart J. Russel...

claim paper

Read More »

289

click to vote

GECCO
2011
Springer

276views Optimization» more GECCO 2011»

Evolution of reward functions for reinforcement learning

14 years 10 months ago

Download hampshire.edu

The reward functions that drive reinforcement learning systems are generally derived directly from the descriptions of the problems that the systems are being used to solve. In so...

Scott Niekum, Lee Spector, Andrew G. Barto

claim paper

Read More »

« Prev « First page 18 / 409 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers