Search Sciweavers | Sciweavers

580 search results - page 40 / 116

» Approximation Algorithms for Temporal Reasoning

142

click to vote

SAGT
2010
Springer

164views Game Theory» more SAGT 2010»

On Nash-Equilibria of Approximation-Stable Games

15 years 25 days ago

Download www.cs.cmu.edu

Abstract. One reason for wanting to compute an (approximate) Nash equilibrium of a game is to predict how players will play. However, if the game has multiple equilibria that are f...

Pranjal Awasthi, Maria-Florina Balcan, Avrim Blum,...

claim paper

Read More »

129

click to vote

ICML
2010
IEEE

231views Machine Learning» more ICML 2010»

Toward Off-Policy Learning Control with Function Approximation

15 years 3 months ago

Download www.sztaki.hu

We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...

Hamid Reza Maei, Csaba Szepesvári, Shalabh ...

claim paper

Read More »

123

click to vote

ATAL
2010
Springer

172views Intelligent Agents» more ATAL 2010»

The multi variable multi constrained distributed constraint optimization framework

15 years 3 months ago

Download www-scf.usc.edu

Service coordination in domains involving temporal constraints and duration uncertainty has previously been solved with a greedy algorithm that attempts to satisfy service requests...

Christopher Portway, Edmund H. Durfee

claim paper

Read More »

127

click to vote

TIME
2009
IEEE

267views Automated Reasoning» more TIME 2009»

Fixing the Semantics for Dynamic Controllability and Providing a More Practical Characterization of Dynamic Execution Strategies

15 years 9 months ago

Download www.cs.vassar.edu

Morris, Muscettola and Vidal (MMV) presented an algorithm for checking the dynamic controllability (DC) of temporal networks in which certain temporal durations are beyond the con...

Luke Hunsberger

claim paper

Read More »

129

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 2 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

« Prev « First page 40 / 116 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers