Sciweavers

335 search results - page 47 / 67
» Learning Simulation Control in General Game-Playing Agents
Sort
View
141
Voted
ATAL
2009
Springer
15 years 10 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
110
Voted
CDC
2009
IEEE
139views Control Systems» more  CDC 2009»
15 years 8 months ago
A bio-plausible design for visual attitude stabilization
— We consider the problem of attitude stabilization using exclusively visual sensory input, and we look for a solution which can satisfy the constraints of a “bio-plausible” ...
Andrea Censi, Shuo Han, Sawyer B. Fuller, Richard ...
138
Voted
AAAI
1994
15 years 4 months ago
GENET: A Connectionist Architecture for Solving Constraint Satisfaction Problems by Iterative Improvement
New approaches to solving constraint satisfaction problems using iterative improvement techniques have been found to be successful on certain, very large problems such as the mill...
Andrew J. Davenport, Edward P. K. Tsang, Chang J. ...
127
Voted
IROS
2006
IEEE
168views Robotics» more  IROS 2006»
15 years 9 months ago
Learning to Drive Among Obstacles
— This paper reports on an outdoor mobile robot that learns to avoid collisions by observing a human driver operate a vehicle equipped with sensors that continuously produce a ma...
Bradley Hamner, Sebastian Scherer, Sanjiv Singh
155
Voted
ITS
1992
Springer
152views Multimedia» more  ITS 1992»
15 years 7 months ago
People Power: A Human-Computer Collaborative Learning System
Abstract. This paper reports our research work in the new field of humancomputer collaborative learning (HCCL). The general architecture of an HCCL is defined. An HCCL system, call...
Pierre Dillenbourg, John A. Self