Search Sciweavers | Sciweavers

28 search results - page 2 / 6

» Hierarchical POMDP Controller Optimization by Likelihood Max...

174

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

16 years 7 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

190

click to vote

PERCOM
2007
ACM

189views Computer Networks» more PERCOM 2007»

Sensor Scheduling for Optimal Observability Using Estimation Entropy

16 years 6 months ago

Download people.eng.unimelb.edu.au

We consider sensor scheduling as the optimal observability problem for partially observable Markov decision processes (POMDP). This model fits to the cases where a Markov process ...

Mohammad Rezaeian

claim paper

Read More »

167

click to vote

AICT
2006
IEEE

129views Communications» more AICT 2006»

Stochastic Thresholding: An approach to Estimator Optimization via Fisher Information Maximization

15 years 10 months ago

Download www.ncc.org.in

In stochastic thresholding, the threshold for quantization of a signal is randomized. An estimator based on quantized signal data can be optimized through stochastic thresholding. ...

Samudra Dasgupta

claim paper

Read More »

189

click to vote

JMLR
2008

110views more JMLR 2008»

Cross-Validation Optimization for Large Scale Structured Classification Kernel Methods

15 years 6 months ago

Download jmlr.csail.mit.edu

We propose a highly efficient framework for penalized likelihood kernel methods applied to multiclass models with a large, structured set of classes. As opposed to many previous a...

Matthias W. Seeger

claim paper

Read More »

200

click to vote

FOCS
2007
IEEE

157views Theoretical Computer Science» more FOCS 2007»

Approximation Algorithms for Partial-Information Based Stochastic Control with Markovian Rewards

16 years 1 months ago

Download www.cis.upenn.edu

We consider a variant of the classic multi-armed bandit problem (MAB), which we call FEEDBACK MAB, where the reward obtained by playing each of n independent arms varies according...

Sudipto Guha, Kamesh Munagala

claim paper

Read More »

« Prev « First page 2 / 6 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers