Sciweavers

700 search results - page 59 / 140
» Combinations of Stit and Actions
Sort
View
ICML
2003
IEEE
14 years 8 months ago
Hierarchical Policy Gradient Algorithms
Hierarchical reinforcement learning is a general framework which attempts to accelerate policy learning in large domains. On the other hand, policy gradient reinforcement learning...
Mohammad Ghavamzadeh, Sridhar Mahadevan
ALT
2005
Springer
14 years 4 months ago
Defensive Universal Learning with Experts
This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedba...
Jan Poland, Marcus Hutter
ICALP
2009
Springer
14 years 2 months ago
Qualitative Concurrent Stochastic Games with Imperfect Information
Abstract. We study a model of games that combines concurrency, imperfect information and stochastic aspects. Those are finite states games in which, at each round, the two players...
Vincent Gripon, Olivier Serre
SAGT
2009
Springer
102views Game Theory» more  SAGT 2009»
14 years 2 months ago
Free-Riding and Free-Labor in Combinatorial Agency
Abstract. This paper studies a setting where a principal needs to motivate teams of agents whose efforts lead to an outcome that stochastically depends on the combination of agent...
Moshe Babaioff, Michal Feldman, Noam Nisan
CIDM
2007
IEEE
14 years 2 months ago
Application of Neural Networks for Data Modeling of Power Systems with Time Varying Nonlinear Loads
— Nowadays power distribution systems typically operate with nonsinusoidal voltages and currents. Harmonic currents from nonlinear loads propagate through the system and cause ha...
Joy Mazumdar, Ganesh K. Venayagamoorthy, Ronald G....