Sciweavers

3876 search results - page 677 / 776
» Dynamic Adaptive Pre-Tenuring
Sort
View
AAAI
1998
13 years 11 months ago
Opponent Modeling in Poker
Poker is an interesting test-bed for artificial intelligence research. It is a game of imperfect knowledge, where multiple competing agents must deal with risk management, agent m...
Darse Billings, Denis Papp, Jonathan Schaeffer, Du...
FLAIRS
1998
13 years 11 months ago
Learning to Race: Experiments with a Simulated Race Car
Our focus is on designing adaptable agents for highly dynamic environments. Wehave implementeda reinforcement learning architecture as the reactive componentof a twolayer control ...
Larry D. Pyeatt, Adele E. Howe
AAAI
1996
13 years 11 months ago
Evolution-Based Discovery of Hierarchical Behaviors
Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...
Justinian P. Rosca, Dana H. Ballard
ICGA
1993
145views Optimization» more  ICGA 1993»
13 years 11 months ago
Genetic Programming of Minimal Neural Nets Using Occam's Razor
A genetic programming method is investigated for optimizing both the architecture and the connection weights of multilayer feedforward neural networks. The genotype of each networ...
Byoung-Tak Zhang, Heinz Mühlenbein
NIPS
1993
13 years 11 months ago
The Parti-Game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-Spaces
Parti-game is a new algorithm for learning feasible trajectories to goal regions in high dimensionalcontinuousstate-spaces. In high dimensions it is essential that learningdoes not...
Andrew W. Moore