Sciweavers
Explore
Publications
Books
Software
Tutorials
Presentations
Lectures Notes
Datasets
Labs
Conferences
Community
Upcoming
Conferences
Top Ranked Papers
Most Viewed Conferences
Conferences by Acronym
Conferences by Subject
Conferences by Year
Tools
PDF Tools
Image Tools
Text Tools
OCR Tools
Symbol and Emoji Tools
On-screen Keyboard
Latex Math Equation to Image
Smart IPA Phonetic Keyboard
Community
Sciweavers
About
Terms of Use
Privacy Policy
Cookies
1262
search results - page 118 / 253
»
Reinforcement Learning: An Introduction
Sort
relevance
views
votes
recent
update
View
thumb
title
81
Voted
ICAART
2010
INSTICC
136
views
Intelligent Agents
»
more
ICAART 2010
»
A Reinforcement Learning Approach for Multiagent Navigation
16 years 13 days ago
Download
scalab.uc3m.es
Francisco Martinez-Gil, Fernando Barber, Miguel Lo...
claim paper
Read More »
122
Voted
ICAART
2010
INSTICC
222
views
Intelligent Agents
»
more
ICAART 2010
»
Exploiting Similarity Information in Reinforcement Learning - Similarity Models for Multi-Armed Bandits and MDPs
16 years 13 days ago
Download
personal.unileoben.ac.at
Ronald Ortner
claim paper
Read More »
92
Voted
ICAART
2010
INSTICC
288
views
Intelligent Agents
»
more
ICAART 2010
»
A Cautious Approach to Generalization in Reinforcement Learning
16 years 13 days ago
Download
www.montefiore.ulg.ac.be
Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel...
claim paper
Read More »
126
click to vote
IUI
2009
ACM
110
views
Software Engineering
»
more
IUI 2009
»
A bayesian reinforcement learning approach for customizing human-robot interfaces
15 years 10 months ago
Download
www.cs.mcgill.ca
Amin Atrash, Joelle Pineau
claim paper
Read More »
140
click to vote
ISDA
2009
IEEE
144
views
Operating System
»
more
ISDA 2009
»
Postponed Updates for Temporal-Difference Reinforcement Learning
15 years 10 months ago
Download
www.science.uva.nl
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson
claim paper
Read More »
« Prev
« First
page 118 / 253
Last »
Next »