Sciweavers

494 search results - page 36 / 99
» Evaluating a Reinforcement Learning Algorithm with a General...
Sort
View
GECCO
2008
Springer
206views Optimization» more  GECCO 2008»
13 years 8 months ago
Improving accuracy of immune-inspired malware detectors by using intelligent features
In this paper, we show that a Bio-inspired classifier’s accuracy can be dramatically improved if it operates on intelligent features. We propose a novel set of intelligent feat...
M. Zubair Shafiq, Syed Ali Khayam, Muddassar Faroo...
ECML
2007
Springer
14 years 1 months ago
Policy Gradient Critics
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
Daan Wierstra, Jürgen Schmidhuber
ACL
2006
13 years 9 months ago
A Rote Extractor with Edit Distance-Based Generalisation and Multi-Corpora Precision Calculation
In this paper, we describe a rote extractor that learns patterns for finding semantic relationships in unrestricted text, with new procedures for pattern generalization and scorin...
Enrique Alfonseca, Pablo Castells, Manabu Okumura,...
ICML
1999
IEEE
14 years 8 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
AI
1998
Springer
13 years 12 months ago
Sequential Instance-Based Learning
This paper presents and evaluates sequential instance-based learning (SIBL), an approach to action selection based upon data gleaned from prior problem solving experiences. SIBL le...
Susan L. Epstein, Jenngang Shih