Search Sciweavers | Sciweavers

312 search results - page 45 / 63

» Learning Partially Observable Deterministic Action Models

100

click to vote

CVPR
1997
IEEE

181views Computer Vision» more CVPR 1997»

Learning bilinear models for two-factor problems in vision

15 years 6 months ago

Download www.merl.com

In many vision problems, we want to infer two (or more) hidden factors which interact to produce our observations. We may want to disentangle illuminant and object colors in color...

William T. Freeman, Joshua B. Tenenbaum

claim paper

Read More »

139

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 3 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

115

click to vote

ATAL
2009
Springer

135views Intelligent Agents» more ATAL 2009»

An empirical analysis of value function-based and policy search reinforcement learning

15 years 9 months ago

Download userweb.cs.utexas.edu

In several agent-oriented scenarios in the real world, an autonomous agent that is situated in an unknown environment must learn through a process of trial and error to take actio...

Shivaram Kalyanakrishnan, Peter Stone

claim paper

Read More »

107

click to vote

ICALP
2009
Springer

132views Programming Languages» more ICALP 2009»

Qualitative Concurrent Stochastic Games with Imperfect Information

15 years 9 months ago

Download hal.archives-ouvertes.fr

Abstract. We study a model of games that combines concurrency, imperfect information and stochastic aspects. Those are ﬁnite states games in which, at each round, the two players...

Vincent Gripon, Olivier Serre

claim paper

Read More »

121

click to vote

ATAL
2007
Springer

129views Intelligent Agents» more ATAL 2007»

Subjective approximate solutions for decentralized POMDPs

15 years 8 months ago

Download www.cs.cmu.edu

A problem of planning for cooperative teams under uncertainty is a crucial one in multiagent systems. Decentralized partially observable Markov decision processes (DECPOMDPs) prov...

Anton Chechetka, Katia P. Sycara

claim paper

Read More »

« Prev « First page 45 / 63 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers