Search Sciweavers | Sciweavers

252 search results - page 31 / 51

» Learning Partially Observable Action Models: Efficient Algor...

135

Voted

ICMLA
2004

114views Machine Learning» more ICMLA 2004»

Planning with predictive state representations

15 years 4 months ago

Download www.eecs.umich.edu

Predictive state representation (PSR) models for controlled dynamical systems have recently been proposed as an alternative to traditional models such as partially observable Mark...

Michael R. James, Satinder P. Singh, Michael L. Li...

claim paper

Read More »

114

click to vote

ICRA
2010
IEEE

163views Robotics» more ICRA 2010»

Exploiting domain knowledge in planning for uncertain robot systems modeled as POMDPs

15 years 1 months ago

Download robotics.ai.uiuc.edu

Abstract— We propose a planning algorithm that allows usersupplied domain knowledge to be exploited in the synthesis of information feedback policies for systems modeled as parti...

Salvatore Candido, James C. Davidson, Seth Hutchin...

claim paper

Read More »

142

Voted

ICML
2009
IEEE

190views Machine Learning» more ICML 2009»

A least squares formulation for a class of generalized eigenvalue problems in machine learning

16 years 3 months ago

Download www.public.asu.edu

Many machine learning algorithms can be formulated as a generalized eigenvalue problem. One major limitation of such formulation is that the generalized eigenvalue problem is comp...

Liang Sun, Shuiwang Ji, Jieping Ye

claim paper

Read More »

117

click to vote

ICML
2006
IEEE

108views Machine Learning» more ICML 2006»

Experience-efficient learning in associative bandit problems

16 years 3 months ago

Download paul.rutgers.edu

We formalize the associative bandit problem framework introduced by Kaelbling as a learning-theory problem. The learning environment is modeled as a k-armed bandit where arm payof...

Alexander L. Strehl, Chris Mesterharm, Michael L. ...

claim paper

Read More »

127

Voted

AAAI
2010

185views Intelligent Agents» more AAAI 2010»

Symbolic Dynamic Programming for First-order POMDPs

15 years 4 months ago

Download www-kd.iai.uni-bonn.de

Partially-observable Markov decision processes (POMDPs) provide a powerful model for sequential decision-making problems with partially-observed state and are known to have (appro...

Scott Sanner, Kristian Kersting

claim paper

Read More »

« Prev « First page 31 / 51 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers