Sciweavers

892 search results - page 133 / 179
» Action respecting embedding
Sort
View
ICML
2010
IEEE
15 years 5 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
COMBINATORICS
2006
126views more  COMBINATORICS 2006»
15 years 4 months ago
Constructions of Representations of Rank Two Semisimple Lie Algebras with Distributive Lattices
We associate one or two posets (which we call "semistandard posets") to any given irreducible representation of a rank two semisimple Lie algebra over C. Elsewhere we ha...
L. Wyatt Alverson II, Robert G. Donnelly, Scott J....
IJCSS
2006
116views more  IJCSS 2006»
15 years 4 months ago
Extracting Motor Unit Firing Information by Independent Component Analysis of Surface Electromyogram: A Preliminary Study Using
Decomposition of electromyogram (EMG) provides a valuable means of obtaining motor unit recruitment and firing rate information. The feasibility of decomposing surface EMG signals...
Ping Zhou, M. M. Lowery, W. Zev Rymer
JFR
2007
150views more  JFR 2007»
15 years 4 months ago
Decisional autonomy of planetary rovers
To achieve the ever increasing demand for science return, planetary exploration rovers require more autonomy to successfully perform their missions. Indeed, the communication dela...
Félix Ingrand, Simon Lacroix, Solange Lemai...
AI
1998
Springer
15 years 4 months ago
Model-Based Average Reward Reinforcement Learning
Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...
Prasad Tadepalli, DoKyeong Ok