Search Sciweavers | Sciweavers

451 search results - page 3 / 91

» Performance evaluation with temporal rewards

242

click to vote

JAIR
2006

157views more JAIR 2006»

Decision-Theoretic Planning with non-Markovian Rewards

15 years 7 months ago

Download www.jair.org

A decision process in which rewards depend on history rather than merely on the current state is called a decision process with non-Markovian rewards (NMRDP). In decisiontheoretic...

Sylvie Thiébaux, Charles Gretton, John K. S...

claim paper

Read More »

229

click to vote

VALUETOOLS
2006
ACM

164views Hardware» more VALUETOOLS 2006»

Analysis of Markov reward models using zero-suppressed multi-terminal BDDs

16 years 1 months ago

Download www.unibw.de

High-level stochastic description methods such as stochastic Petri nets, stochastic UML statecharts etc., together with speciﬁcations of performance variables (PVs), enable a co...

Kai Lampka, Markus Siegle

claim paper

Read More »

207

click to vote

AAAI
2006

128views Intelligent Agents» more AAAI 2006»

QUICR-Learning for Multi-Agent Coordination

15 years 8 months ago

Download www.aaai.org

Coordinating multiple agents that need to perform a sequence of actions to maximize a system level reward requires solving two distinct credit assignment problems. First, credit m...

Adrian K. Agogino, Kagan Tumer

claim paper

Read More »

213

click to vote

CHI
2010
ACM

190views Human Computer Interaction» more CHI 2010»

Physical activity motivating games: virtual rewards for real activity

16 years 1 months ago

Download brochures.austria.info

Contemporary lifestyle has become increasingly sedentary: little physical (sports, exercises) and much sedentary (TV, computers) activity. The nature of sedentary activity is self...

Shlomo Berkovsky, Mac Coombe, Jill Freyne, Dipak B...

claim paper

Read More »

219

click to vote

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance

15 years 8 months ago

Download robotic.media.mit.edu

As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

« Prev « First page 3 / 91 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers