Search Sciweavers | Sciweavers

85 search results - page 15 / 17

» Approximate Policy Iteration with a Policy Language Bias

click to vote

AAAI
2006

199views Intelligent Agents» more AAAI 2006»

Decision Making in Uncertain Real-World Domains Using DT-Golog

13 years 8 months ago

Download www.aaai.org

DTGolog, a decision-theoretic agent programming language based on the situation calculus, was proposed to ease some of the computational difficulties associated with Markov Decisi...

Mikhail Soutchanski, Huy Pham, John Mylopoulos

claim paper

Read More »

click to vote

QUESTA
2010

112views more QUESTA 2010»

Admission control for a multi-server queue with abandonment

13 years 5 months ago

Download www-bcf.usc.edu

In a M/M/N+M queue, when there are many customers waiting, it may be preferable to reject a new arrival rather than risk that arrival later abandoning without receiving service. O...

Yasar Levent Koçaga, Amy R. Ward

claim paper

Read More »

click to vote

ESOP
2007
Springer

94views Programming Languages» more ESOP 2007»

Small Witnesses for Abstract Interpretation-Based Proofs

14 years 1 months ago

Download www.irisa.fr

tnesses for Abstract Interpretation-based Proofs Fr´ed´eric Besson, Thomas Jensen, and Tiphaine Turpin IRISA/{Inria, CNRS, Universit´e de Rennes 1} Campus de Beaulieu, F-35042 R...

Frédéric Besson, Thomas P. Jensen, T...

claim paper

Read More »

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

13 years 8 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

click to vote

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

13 years 9 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 15 / 17 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers