Search Sciweavers | Sciweavers

86 search results - page 8 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

260

click to vote

PKDD
2010
Springer

164views Data Mining» more PKDD 2010»

Efficient Planning in Large POMDPs through Policy Graph Based Factorized Approximations

15 years 4 months ago

Download users.ics.tkk.fi

Partially observable Markov decision processes (POMDPs) are widely used for planning under uncertainty. In many applications, the huge size of the POMDP state space makes straightf...

Joni Pajarinen, Jaakko Peltonen, Ari Hottinen, Mik...

claim paper

Read More »

208

click to vote

NIPS
2007

158views Information Technology» more NIPS 2007»

Reinforcement Learning in Continuous Action Spaces through Sequential Monte Carlo Methods

15 years 8 months ago

Download books.nips.cc

Learning in real-world domains often requires to deal with continuous state and action spaces. Although many solutions have been proposed to apply Reinforcement Learning algorithm...

Alessandro Lazaric, Marcello Restelli, Andrea Bona...

claim paper

Read More »

174

Voted

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

16 years 12 days ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

260

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

15 years 1 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

199

click to vote

ICML
2005
IEEE

136views Machine Learning» more ICML 2005»

Learning as search optimization: approximate large margin methods for structured prediction

16 years 7 months ago

Download www.isi.edu

Mappings to structured output spaces (strings, trees, partitions, etc.) are typically learned using extensions of classification algorithms to simple graphical structures (eg., li...

Daniel Marcu, Hal Daumé III

claim paper

Read More »

« Prev « First page 8 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers