Search Sciweavers | Sciweavers

1138 search results - page 168 / 228

» Feature Markov Decision Processes

149

click to vote

AAAI
2007

117views Intelligent Agents» more AAAI 2007»

Authorial Idioms for Target Distributions in TTD-MDPs

15 years 5 months ago

Download www.cc.gatech.edu

In designing Markov Decision Processes (MDP), one must deﬁne the world, its dynamics, a set of actions, and a reward function. MDPs are often applied in situations where there i...

David L. Roberts, Sooraj Bhat, Kenneth St. Clair, ...

claim paper

Read More »

110

click to vote

AAAI
2007

100views Intelligent Agents» more AAAI 2007»

Compact Spectral Bases for Value Function Approximation Using Kronecker Factorization

15 years 5 months ago

Download www.cs.umass.edu

A new spectral approach to value function approximation has recently been proposed to automatically construct basis functions from samples. Global basis functions called proto-val...

Jeffrey Johns, Sridhar Mahadevan, Chang Wang

claim paper

Read More »

149

Voted

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

15 years 5 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

113

Voted

ACMACE
2008
ACM

106views Human Computer Interaction» more ACMACE 2008»

AIRSF: a new entertainment adaptive framework for stress free air travels

15 years 5 months ago

Download www.idemployee.id.tue.nl

In this paper, we present a new entertainment adaptive framework AIRSF for stress free air travels. Based on the passenger's current and target comfort states, user entertain...

Hao Liu, Jun Hu, Matthias Rauterberg

claim paper

Read More »

144

click to vote

ATAL
2008
Springer

134views Intelligent Agents» more ATAL 2008»

MB-AIM-FSI: a model based framework for exploiting gradient ascent multiagent learners in strategic interactions

15 years 5 months ago

Download www.cs.utexas.edu

Future agent applications will increasingly represent human users autonomously or semi-autonomously in strategic interactions with similar entities. Hence, there is a growing need...

Doran Chakraborty, Sandip Sen

claim paper

Read More »

« Prev « First page 168 / 228 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers