Search Sciweavers | Sciweavers

508 search results - page 26 / 102

» Learning for stochastic dynamic programming

click to vote

SAC
2009
ACM

108views Applied Computing» more SAC 2009»

Modular implementation of adaptive decisions in stochastic simulations

15 years 9 months ago

Download people.cs.vt.edu

We present a modular approach to implement adaptive decisions with existing scientiﬁc codes. Using a sophisticated system software tool based on the function call interception t...

Pilsung Kang 0002, Yang Cao, Naren Ramakrishnan, C...

claim paper

Read More »

102

click to vote

NIPS
2007

137views Information Technology» more NIPS 2007»

Sequential Hypothesis Testing under Stochastic Deadlines

15 years 3 months ago

Download www.cogsci.ucsd.edu

Most models of decision-making in neuroscience assume an inﬁnite horizon, which yields an optimal solution that integrates evidence up to a ﬁxed decision threshold; however, u...

Peter Frazier, Angela Yu

claim paper

Read More »

116

click to vote

AAAI
2004

103views Intelligent Agents» more AAAI 2004»

Stochastic Local Search for POMDP Controllers

15 years 3 months ago

Download www.cs.utoronto.ca

The search for finite-state controllers for partially observable Markov decision processes (POMDPs) is often based on approaches like gradient ascent, attractive because of their ...

Darius Braziunas, Craig Boutilier

claim paper

Read More »

119

click to vote

CDC
2009
IEEE

172views Control Systems» more CDC 2009»

Approximate dynamic programming using fluid and diffusion approximations with applications to power management

15 years 7 months ago

Download www.cs.caltech.edu

—TD learning and its reﬁnements are powerful tools for approximating the solution to dynamic programming problems. However, the techniques provide the approximate solution only...

Wei Chen, Dayu Huang, Ankur A. Kulkarni, Jayakrish...

claim paper

Read More »

142

click to vote

JCP
2007

143views more JCP 2007»

Noisy K Best-Paths for Approximate Dynamic Programming with Application to Portfolio Optimization

15 years 2 months ago

Download www.academypublisher.com

Abstract— We describe a general method to transform a non-Markovian sequential decision problem into a supervised learning problem using a K-bestpaths algorithm. We consider an a...

Nicolas Chapados, Yoshua Bengio

claim paper

Read More »

« Prev « First page 26 / 102 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers