Search Sciweavers | Sciweavers

2011 search results - page 335 / 403

» Universal Reinforcement Learning

186

click to vote

ATAL
2008
Springer

180views Intelligent Agents» more ATAL 2008»

On the usefulness of opponent modeling: the Kuhn Poker case study

15 years 9 months ago

Download www.ifaamas.org

The application of reinforcement learning algorithms to Partially Observable Stochastic Games (POSG) is challenging since each agent does not have access to the whole state inform...

Alessandro Lazaric, Mario Quaresimale, Marcello Re...

claim paper

Read More »

193

click to vote

NIPS
1993

128views Information Technology» more NIPS 1993»

Convergence of Stochastic Iterative Dynamic Programming Algorithms

15 years 8 months ago

Download www.bitsavers.org

Recent developments in the area of reinforcement learning have yielded a number of new algorithms for the prediction and control of Markovian environments. These algorithms,includ...

Tommi Jaakkola, Michael I. Jordan, Satinder P. Sin...

claim paper

Read More »

203

click to vote

ICML
2010
IEEE

247views Machine Learning» more ICML 2010»

Inverse Optimal Control with Linearly-Solvable MDPs

15 years 8 months ago

Download www.cs.washington.edu

We present new algorithms for inverse optimal control (or inverse reinforcement learning, IRL) within the framework of linearlysolvable MDPs (LMDPs). Unlike most prior IRL algorit...

Dvijotham Krishnamurthy, Emanuel Todorov

claim paper

Read More »

204

click to vote

PEPM
2011
ACM

210views Software Engineering» more PEPM 2011»

Adaptation-based programming in java

14 years 10 months ago

Download web.engr.oregonstate.edu

Writing deterministic programs is often difﬁcult for problems whose optimal solutions depend on unpredictable properties of the programs’ inputs. Difﬁculty is also encounter...

Tim Bauer, Martin Erwig, Alan Fern, Jervis Pinto

claim paper

Read More »

208

click to vote

ALT
2004
Springer

95views Machine Learning» more ALT 2004»

New Revision Algorithms

16 years 4 months ago

Download www.cs.uky.edu

A revision algorithm is a learning algorithm that identiﬁes the target concept, starting from an initial concept. Such an algorithm is considered eﬃcient if its complexity (in ...

Judy Goldsmith, Robert H. Sloan, Balázs Sz&...

claim paper

Read More »

« Prev « First page 335 / 403 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers