Search Sciweavers | Sciweavers

185 search results - page 13 / 37

» Simulation-Based Optimization Algorithms for Finite-Horizon ...

115

click to vote

ICRA
2007
IEEE

126views Robotics» more ICRA 2007»

A formal framework for robot learning and control under model uncertainty

15 years 10 months ago

Download www.cs.mcgill.ca

— While the Partially Observable Markov Decision Process (POMDP) provides a formal framework for the problem of robot control under uncertainty, it typically assumes a known and ...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

134

Voted

IJCAI
2007

154views Artificial Intelligence» more IJCAI 2007»

A Hybridized Planner for Stochastic Domains

15 years 5 months ago

Download www.ijcai.org

Markov Decision Processes are a powerful framework for planning under uncertainty, but current algorithms have difﬁculties scaling to large problems. We present a novel probabil...

Mausam, Piergiorgio Bertoli, Daniel S. Weld

claim paper

Read More »

127

Voted

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

15 years 5 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

124

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 4 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

138

Voted

CCE
2004

162views Software Engineering» more CCE 2004»

An algorithmic framework for improving heuristic solutions: Part II. A new version of the stochastic traveling salesman problem

15 years 3 months ago

Download www.che.gatech.edu

The algorithmic framework developed for improving heuristic solutions of the new version of deterministic TSP [Choi et al., 2002] is extended to the stochastic case. To verify the...

Jaein Choi, Jay H. Lee, Matthew J. Realff

claim paper

Read More »

« Prev « First page 13 / 37 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers