Sciweavers

1235 search results - page 168 / 247
» Reinforcement learning in a nutshell
Sort
View
ATAL
2008
Springer
14 years 4 days ago
Expediting RL by using graphical structures
The goal of Reinforcement learning (RL) is to maximize reward (minimize cost) in a Markov decision process (MDP) without knowing the underlying model a priori. RL algorithms tend ...
Peng Dai, Alexander L. Strehl, Judy Goldsmith
AAAI
2006
13 years 11 months ago
Modeling Human Decision Making in Cliff-Edge Environments
In this paper we propose a model for human learning and decision making in environments of repeated Cliff-Edge (CE) interactions. In CE environments, which include common daily in...
Ron Katz, Sarit Kraus
ICML
2010
IEEE
13 years 8 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
ICALT
2005
IEEE
14 years 3 months ago
Can Collaborative Technologies Improve Management Education?
This paper explores the potential impact of collaborative technologies on improving management education. The first goal is to expose students to tools and practices that not only...
Marie-Noëlle Bessagnet, Lee Schlenker, Robert...
IVA
2005
Springer
14 years 3 months ago
Teaching Virtual Characters How to Use Body Language
Abstract. Non-verbal communication, or “body language”, is a critical component in constructing believable virtual characters. Most often, body language is implemented by a set...
Doron A. Friedman, Marco Gillies