Search Sciweavers | Sciweavers

260 search results - page 45 / 52

» Quasi-Deterministic Partially Observable Markov Decision Pro...

131

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

147

click to vote

DSN
2009
IEEE

131views Computer Networks» more DSN 2009»

RRE: A game-theoretic intrusion Response and Recovery Engine

15 years 2 months ago

Download netfiles.uiuc.edu

Preserving the availability and integrity of networked computing systems in the face of fast-spreading intrusions requires advances not only in detection algorithms, but also in a...

Saman A. Zonouz, Himanshu Khurana, William H. Sand...

claim paper

Read More »

144

click to vote

APNOMS
2006
Springer

103views Computer Networks» more APNOMS 2006»

Network-Adaptive QoS Routing Using Local Information

15 years 8 months ago

Download www.apnoms.org

In this paper, we propose the localized adaptive QoS routing scheme using POMDP(partially observable Markov Decision Processes) and Exploration Bonus. In order to deal with POMDP p...

Jeongsoo Han

claim paper

Read More »

108

click to vote

ANOR
2010

102views more ANOR 2010»

Optimal control of dosage decisions in controlled ovarian hyperstimulation

15 years 4 months ago

Download www.castlelab.princeton.edu

Abstract In the controlled ovary hyperstimulation (COH) cycle of the in vitro fertilization-embryo transfer (IVFET) therapy, the clinicians observe the patients' responses to ...

Miao He, Lei Zhao, Warren B. Powell

claim paper

Read More »

141

click to vote

FGR
2006
IEEE

205views Biometrics» more FGR 2006»

Tracking Using Dynamic Programming for Appearance-Based Sign Language Recognition

15 years 10 months ago

Download www-i6.informatik.rwth-aachen.de

We present a novel tracking algorithm that uses dynamic programming to determine the path of target objects and that is able to track an arbitrary number of different objects. The...

Philippe Dreuw, Thomas Deselaers, David Rybach, Da...

claim paper

Read More »

« Prev « First page 45 / 52 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers