Sciweavers

50 search results - page 6 / 10
» Learning Continuous Action Models in a Real-Time Strategy En...
Sort
View
COLT
2008
Springer
13 years 8 months ago
Adapting to a Changing Environment: the Brownian Restless Bandits
In the multi-armed bandit (MAB) problem there are k distributions associated with the rewards of playing each of k strategies (slot machine arms). The reward distributions are ini...
Aleksandrs Slivkins, Eli Upfal
ICML
1990
IEEE
13 years 10 months ago
Explanations of Empirically Derived Reactive Plans
Given an adequate simulation model of the task environment and payoff function that measures the quality of partially successful plans, competition-based heuristics such as geneti...
Diana F. Gordon, John J. Grefenstette
AIIDE
2009
13 years 7 months ago
Learning Character Behaviors Using Agent Modeling in Games
Our goal is to provide learning mechanisms to game agents so they are capable of adapting to new behaviors based on the actions of other agents. We introduce a new on-line reinfor...
Richard Zhao, Duane Szafron
BICA
2010
13 years 1 months ago
Application Feedback in Guiding a Deep-Layered Perception Model
Deep-layer machine learning architectures continue to emerge as a promising biologically-inspired framework for achieving scalable perception in artificial agents. State inference ...
Itamar Arel, Shay Berant
AI
1998
Springer
13 years 6 months ago
Model-Based Average Reward Reinforcement Learning
Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...
Prasad Tadepalli, DoKyeong Ok