Sciweavers

1930 search results - page 133 / 386
» Learning to Solve QBF
Sort
View
GECCO
2005
Springer
155views Optimization» more  GECCO 2005»
15 years 9 months ago
Co-evolving recurrent neurons learn deep memory POMDPs
Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...
Faustino J. Gomez, Jürgen Schmidhuber
ECP
1997
Springer
105views Robotics» more  ECP 1997»
15 years 8 months ago
Planning, Learning, and Executing in Autonomous Systems
Systems that act autonomously in the environment have to be able to integrate three basic behaviors: planning, execution, and learning. Planning involves describing a set of action...
Ramón García-Martínez, Daniel...
GECCO
2006
Springer
208views Optimization» more  GECCO 2006»
15 years 7 months ago
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
Both genetic algorithms (GAs) and temporal difference (TD) methods have proven effective at solving reinforcement learning (RL) problems. However, since few rigorous empirical com...
Matthew E. Taylor, Shimon Whiteson, Peter Stone
IJCAI
2003
15 years 5 months ago
An Integrated Multilevel Learning Approach to Multiagent Coalition Formation
In this paper we describe an integrated multilevel learning approach to multiagent coalition formation in a real-time environment. In our domain, agents negotiate to form teams to...
Leen-Kiat Soh, Xin Li
135
Voted
AR
2007
105views more  AR 2007»
15 years 4 months ago
Reinforcement learning of a continuous motor sequence with hidden states
—Reinforcement learning is the scheme for unsupervised learning in which robots are expected to acquire behavior skills through self-explorations based on reward signals. There a...
Hiroaki Arie, Tetsuya Ogata, Jun Tani, Shigeki Sug...