Sciweavers

1101 search results - page 138 / 221
» heuristics 2007
Sort
View
NIPS
1994
13 years 11 months ago
Reinforcement Learning with Soft State Aggregation
It is widely accepted that the use of more compact representations than lookup tables is crucial to scaling reinforcement learning (RL) algorithms to real-world problems. Unfortun...
Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...
IJCAI
1989
13 years 11 months ago
Ordering Problem Subgoals
Most past research work on problem subgoal ordering are of a heuristic nature and very little attempt has been made to reveal the inherent relationship between subgoal ordering co...
Jie Cheng, Keki B. Irani
IJCAI
1989
13 years 11 months ago
Constraint Satisfiability Algorithms for Interactive Student Scheduling
A constraint satisfiability problem consists of a set of variables, their associated domains (i.e., the set of values the variable can take) and a set of constraints on these vari...
Ronen Feldman, Martin Charles Golumbic
IJCAI
1989
13 years 11 months ago
The Reason for the Benefits of Minimax Search
based on an abstract concept of quiescence. In the following we sketch this and a related model, describe the design of our experiments, and present the results of our simulation s...
Anton Scheucher, Hermann Kaindl
IJCAI
1989
13 years 11 months ago
Selective Learning of Macro-operators with Perfect Causality
A macro-operator is an integrated operator consisting of plural primitive operators and enables a problem solver to solve more efficiently. However, if a learning system generates...
Seiji Yamada, Sabinro Tsuji