Search Sciweavers | Sciweavers

332 search results - page 28 / 67

» Ranking policies in discrete Markov decision processes

UAI
1997

108views Artificial Intelligence» more UAI 1997»

13 years 9 months ago

Correlated Action Effects in Decision Theoretic Regression

Much recent research in decision theoretic planning has adopted Markov decision processes (MDPs) as the model of choice, and has attempted to make their solution more tractable by...

Craig Boutilier

claim paper

Read More »

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

14 years 5 days ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

click to vote

CDC
2008
IEEE

145views Control Systems» more CDC 2008»

Necessary and sufficient conditions for success of the nuclear norm heuristic for rank minimization

13 years 8 months ago

Download www.ist.caltech.edu

Minimizing the rank of a matrix subject to constraints is a challenging problem that arises in many applications in control theory, machine learning, and discrete geometry. This c...

Benjamin Recht, Weiyu Xu, Babak Hassibi

claim paper

Read More »

click to vote

MP
2011

191views Intelligent Agents» more MP 2011»

Null space conditions and thresholds for rank minimization

13 years 3 months ago

Download pages.cs.wisc.edu

Minimizing the rank of a matrix subject to constraints is a challenging problem that arises in many applications in machine learning, control theory, and discrete geometry. This c...

Benjamin Recht, Weiyu Xu, Babak Hassibi

claim paper

Read More »

click to vote

FCCM
2006
IEEE

106views VLSI» more FCCM 2006»

Scalable Hardware Architecture for Real-Time Dynamic Programming Applications

14 years 2 months ago

Download www.ece.utk.edu

Abstract— This paper introduces a novel architecture for performing the core computations required by dynamic programming (DP) techniques. The latter pertain to a vast range of a...

Brad Matthews, Itamar Elhanany

claim paper

Read More »

« Prev « First page 28 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers