Sciweavers

1578 search results - page 235 / 316
» Algorithmic randomness of continuous functions
Sort
View
ECML
2006
Springer
15 years 6 months ago
Approximate Policy Iteration for Closed-Loop Learning of Visual Tasks
Abstract. Approximate Policy Iteration (API) is a reinforcement learning paradigm that is able to solve high-dimensional, continuous control problems. We propose to exploit API for...
Sébastien Jodogne, Cyril Briquet, Justus H....
108
Voted
ATAL
2008
Springer
15 years 4 months ago
Artificial agents learning human fairness
Recent advances in technology allow multi-agent systems to be deployed in cooperation with or as a service for humans. Typically, those systems are designed assuming individually ...
Steven de Jong, Karl Tuyls, Katja Verbeeck
127
Voted
CGF
2008
129views more  CGF 2008»
15 years 2 months ago
Sequential Monte Carlo Adaptation in Low-Anisotropy Participating Media
This paper presents a novel method that effectively combines both control variates and importance sampling in a sequential Monte Carlo context. The radiance estimates computed dur...
Vincent Pegoraro, Ingo Wald, Steven G. Parker
EAAI
2008
128views more  EAAI 2008»
15 years 2 months ago
Dual heuristic programming based nonlinear optimal control for a synchronous generator
This paper presents the design of an infinite horizon nonlinear optimal neurocontroller that replaces the conventional automatic voltage regulator and the turbine governor (CONVC)...
Jung-Wook Park, Ronald G. Harley, Ganesh K. Venaya...
IAJIT
2007
146views more  IAJIT 2007»
15 years 2 months ago
Adaptive Optimizing of Hello Messages in Wireless Ad-Hoc Networks
: Routing is an important functional aspect in wireless ad-hoc networks that handles discovering and maintaining the paths between nodes within a network. Due to nodes mobility, th...
Essam Natsheh, Adznan B. Jantan, Sabira Khatun, Su...