Search Sciweavers | Sciweavers

473 search results - page 14 / 95

» Optimal policy switching algorithms for reinforcement learni...

151

click to vote

ATAL
2009
Springer

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

16 years 18 days ago

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...

Todd Hester, Peter Stone

claim paper

Read More »

198

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 11 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

173

click to vote

SASO
2009
IEEE

172views Control Systems» more SASO 2009»

Distributed W-Learning: Multi-Policy Optimization in Self-Organizing Systems

16 years 22 days ago

Download www.scss.tcd.ie

—Large-scale agent-based systems are required to self-optimize towards multiple, potentially conﬂicting, policies of varying spatial and temporal scope. As a result, not all ag...

Ivana Dusparic, Vinny Cahill

claim paper

Read More »

148

Voted

ECML
2005
Springer

101views Machine Learning» more ECML 2005»

Model-Based Online Learning of POMDPs

15 years 11 months ago

Download www.cs.bgu.ac.il

Abstract. Learning to act in an unknown partially observable domain is a difﬁcult variant of the reinforcement learning paradigm. Research in the area has focused on model-free m...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

254

click to vote

Publication

334views

Rollout Sampling Approximate Policy Iteration

16 years 3 months ago

Download www.springerlink.com

Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 14 / 95 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers