Search Sciweavers | Sciweavers

19 search results - page 4 / 4

» Strong Controllability of Disjunctive Temporal Problems with...

213

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

15 years 8 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

199

click to vote

ICRA
2006
IEEE

131views Robotics» more ICRA 2006»

Using Reinforcement Learning to Improve Exploration Trajectories for Error Minimization

16 years 1 months ago

Download mapleleaf.csail.mit.edu

Abstract— The mapping and localization problems have received considerable attention in robotics recently. The exploration problem that drives mapping has started to generate sim...

Thomas Kollar, Nicholas Roy

claim paper

Read More »

203

click to vote

AAAI
2007

156views Intelligent Agents» more AAAI 2007»

Automated Online Mechanism Design and Prophet Inequalities

15 years 9 months ago

Download www.cs.cmu.edu

Recent work on online auctions for digital goods has explored the role of optimal stopping theory — particularly secretary problems — in the design of approximately optimal on...

Mohammad Taghi Hajiaghayi, Robert D. Kleinberg, Tu...

claim paper

Read More »

166

click to vote

RTSS
2007
IEEE

84views Control Systems» more RTSS 2007»

A UML-Based Design Framework for Time-Triggered Applications

16 years 1 months ago

Download www.comp.nus.edu.sg

Time-triggered architectures (TTAs) are strong candidate platforms for safety-critical real-time applications. A typical time-triggered architecture is constituted by one or more ...

Kathy Dang Nguyen, P. S. Thiagarajan, Weng-Fai Won...

claim paper

Read More »

« Prev « First page 4 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers