Search Sciweavers | Sciweavers

282 search results - page 36 / 57

» Online Learning of Approximate Dependency Parsing Algorithms

180

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

15 years 8 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

147

click to vote

ECAI
2006
Springer

245views Artificial Intelligence» more ECAI 2006»

Least Squares SVM for Least Squares TD Learning

15 years 8 months ago

Download homepages.feis.herts.ac.uk

Abstract. We formulate the problem of least squares temporal difference learning (LSTD) in the framework of least squares SVM (LS-SVM). To cope with the large amount (and possible ...

Tobias Jung, Daniel Polani

claim paper

Read More »

147

click to vote

IMC
2009
ACM

152views Internet Technology» more IMC 2009»

Scalable proximity estimation and link prediction in online social networks

15 years 11 months ago

Download userweb.cs.utexas.edu

Proximity measures quantify the closeness or similarity between nodes in a social network and form the basis of a range of applications in social sciences, business, information t...

Han Hee Song, Tae Won Cho, Vacha Dave, Yin Zhang, ...

claim paper

Read More »

181

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

15 years 2 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

113

click to vote

ESANN
2000

152views Neural Networks» more ESANN 2000»

An algorithm for the addition of time-delayed connections to recurrent neural networks

15 years 5 months ago

Download www.dice.ucl.ac.be

: Recurrent neural networks possess interesting universal approximation capabilities, making them good candidates for time series modeling. Unfortunately, long term dependencies ar...

Romuald Boné, Michel Crucianu, Jean Pierre ...

claim paper

Read More »

« Prev « First page 36 / 57 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers