Search Sciweavers | Sciweavers

779 search results - page 58 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

Voted

ICPR
2006
IEEE

260views computer vision» more ICPR 2006»

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

16 years 3 months ago

Download ee2.chit.edu.tw

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy o...

Siwei Luo, Yu Zheng, Ziang Lv

claim paper

Read More »

151

click to vote

ICDM
2010
IEEE

193views Data Mining» more ICDM 2010»

Supervised Link Prediction Using Multiple Sources

15 years 10 days ago

Download www.cs.utexas.edu

Link prediction is a fundamental problem in social network analysis and modern-day commercial applications such as Facebook and Myspace. Most existing research approaches this pro...

Zhengdong Lu, Berkant Savas, Wei Tang, Inderjit S....

claim paper

Read More »

104

Voted

AIPS
2006

141views Artificial Intelligence» more AIPS 2006»

Combining Stochastic Task Models with Reinforcement Learning for Dynamic Scheduling

15 years 3 months ago

Download www.aaai.org

We view dynamic scheduling as a sequential decision problem. Firstly, we introduce a generalized planning operator, the stochastic task model (STM), which predicts the effects of ...

Malcolm J. A. Strens

claim paper

Read More »

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 8 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

144

Voted

PKDD
2009
Springer

184views Data Mining» more PKDD 2009»

Boosting Active Learning to Optimality: A Tractable Monte-Carlo, Billiard-Based Algorithm

15 years 7 months ago

Download www.lri.fr

Abstract. This paper focuses on Active Learning with a limited number of queries; in application domains such as Numerical Engineering, the size of the training set might be limite...

Philippe Rolet, Michèle Sebag, Olivier Teyt...

claim paper

Read More »

« Prev « First page 58 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers