Sciweavers

378 search results - page 71 / 76
» Reinforcement Learning for Online Control of Evolutionary Al...
Sort
View
NN
2010
Springer
125views Neural Networks» more  NN 2010»
13 years 6 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...
AAAI
2007
13 years 9 months ago
MasDISPO: A Multiagent Decision Support System for Steel Production and Control
In the majority of cases, steel production constitutes the inception of the Supply Chains they are involved just as in automotive clusters or aerospace. Steel manufacturing compan...
Sven Jacobi, Esteban León-Soto, Cristi&aacu...
IJCAI
2007
13 years 9 months ago
Fast Image Alignment Using Anytime Algorithms
Image alignment refers to finding the best transformation from a fixed reference image to a new image of a scene. This process is often guided by similarity measures between ima...
Rupert Brooks, Tal Arbel, Doina Precup
ICDM
2002
IEEE
70views Data Mining» more  ICDM 2002»
14 years 14 days ago
Progressive Modeling
Presently, inductive learning is still performed in a frustrating batch process. The user has little interaction with the system and no control over the final accuracy and traini...
Wei Fan, Haixun Wang, Philip S. Yu, Shaw-hwa Lo, S...
SIGIR
2011
ACM
12 years 10 months ago
Social context summarization
We study a novel problem of social context summarization for Web documents. Traditional summarization research has focused on extracting informative sentences from standard docume...
Zi Yang, Keke Cai, Jie Tang, Li Zhang, Zhong Su, J...