Sciweavers
Explore
Publications
Books
Software
Tutorials
Presentations
Lectures Notes
Datasets
Labs
Conferences
Community
Upcoming
Conferences
Top Ranked Papers
Most Viewed Conferences
Conferences by Acronym
Conferences by Subject
Conferences by Year
Tools
Sci2ools
International Keyboard
Graphical Social Symbols
CSS3 Style Generator
OCR
Web Page to Image
Web Page to PDF
Merge PDF
Split PDF
Latex Equation Editor
Extract Images from PDF
Convert JPEG to PS
Convert Latex to Word
Convert Word to PDF
Image Converter
PDF Converter
Community
Sciweavers
About
Terms of Use
Privacy Policy
Cookies
509
search results - page 14 / 102
»
Compositional Models for Reinforcement Learning
Sort
relevance
views
votes
recent
update
View
thumb
title
20
click to vote
NIPS
1994
98
views
Information Technology
»
more
NIPS 1994
»
A Novel Reinforcement Model of Birdsong Vocalization Learning
13 years 9 months ago
Download
papers.cnl.salk.edu
Kenji Doya, Terrence J. Sejnowski
claim paper
Read More »
24
click to vote
AAMAS
2005
Springer
113
views
Intelligent Agents
»
more
AAMAS 2005
»
SMART (Stochastic Model Acquisition with ReinforcemenT) Learning Agents: A Preliminary Report
13 years 7 months ago
Download
www.soi.city.ac.uk
Christopher Child, Kostas Stathis
claim paper
Read More »
22
click to vote
ACL
2011
172
views
Computational Linguistics
»
more
ACL 2011
»
Hierarchical Reinforcement Learning and Hidden Markov Models for Task-Oriented Natural Language Generation
12 years 11 months ago
Download
www.dfki.de
Nina Dethlefs, Heriberto Cuayáhuitl
claim paper
Read More »
31
click to vote
CORR
2011
Springer
186
views
Education
»
more
CORR 2011
»
A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control
12 years 11 months ago
Download
www.cs.utexas.edu
Todd Hester, Michael Quinlan, Peter Stone
claim paper
Read More »
22
click to vote
ICML
2005
IEEE
127
views
Machine Learning
»
more
ICML 2005
»
Exploration and apprenticeship learning in reinforcement learning
14 years 8 months ago
Download
ai.stanford.edu
We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...
Pieter Abbeel, Andrew Y. Ng
claim paper
Read More »
« Prev
« First
page 14 / 102
Last »
Next »