Search Sciweavers | Sciweavers

3876 search results - page 677 / 776

» Dynamic Adaptive Pre-Tenuring

210

click to vote

AAAI
1998

168views Intelligent Agents» more AAAI 1998»

Opponent Modeling in Poker

15 years 8 months ago

Download www.aaai.org

Poker is an interesting test-bed for artificial intelligence research. It is a game of imperfect knowledge, where multiple competing agents must deal with risk management, agent m...

Darse Billings, Denis Papp, Jonathan Schaeffer, Du...

claim paper

Read More »

187

click to vote

FLAIRS
1998

130views Artificial Intelligence» more FLAIRS 1998»

Learning to Race: Experiments with a Simulated Race Car

15 years 8 months ago

Download www.aaai.org

Our focus is on designing adaptable agents for highly dynamic environments. Wehave implementeda reinforcement learning architecture as the reactive componentof a twolayer control ...

Larry D. Pyeatt, Adele E. Howe

claim paper

Read More »

216

click to vote

AAAI
1996

191views Intelligent Agents» more AAAI 1996»

Evolution-Based Discovery of Hierarchical Behaviors

15 years 8 months ago

Download www.aaai.org

Procedural representations of control policies have two advantages when facing the scale-up problem in learning tasks. First they are implicit, with potential for inductive genera...

Justinian P. Rosca, Dana H. Ballard

claim paper

Read More »

224

click to vote

ICGA
1993

145views Optimization» more ICGA 1993»

Genetic Programming of Minimal Neural Nets Using Occam's Razor

15 years 8 months ago

Download bi.snu.ac.kr

A genetic programming method is investigated for optimizing both the architecture and the connection weights of multilayer feedforward neural networks. The genotype of each networ...

Byoung-Tak Zhang, Heinz Mühlenbein

claim paper

Read More »

195

Voted

NIPS
1993

92views Information Technology» more NIPS 1993»

The Parti-Game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-Spaces

15 years 8 months ago

Download www.ri.cmu.edu

Parti-game is a new algorithm for learning feasible trajectories to goal regions in high dimensionalcontinuousstate-spaces. In high dimensions it is essential that learningdoes not...

Andrew W. Moore

claim paper

Read More »

« Prev « First page 677 / 776 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers