Sciweavers
Explore
Publications
Books
Software
Tutorials
Presentations
Lectures Notes
Datasets
Labs
Conferences
Community
Upcoming
Conferences
Top Ranked Papers
Most Viewed Conferences
Conferences by Acronym
Conferences by Subject
Conferences by Year
Tools
Sci2ools
International Keyboard
Graphical Social Symbols
CSS3 Style Generator
OCR
Web Page to Image
Web Page to PDF
Merge PDF
Split PDF
Latex Equation Editor
Extract Images from PDF
Convert JPEG to PS
Convert Latex to Word
Convert Word to PDF
Image Converter
PDF Converter
Community
Sciweavers
About
Terms of Use
Privacy Policy
Cookies
1262
search results - page 118 / 253
»
Reinforcement Learning: An Introduction
Sort
relevance
views
votes
recent
update
View
thumb
title
17
click to vote
ICAART
2010
INSTICC
136
views
Intelligent Agents
»
more
ICAART 2010
»
A Reinforcement Learning Approach for Multiagent Navigation
14 years 5 months ago
Download
scalab.uc3m.es
Francisco Martinez-Gil, Fernando Barber, Miguel Lo...
claim paper
Read More »
29
click to vote
ICAART
2010
INSTICC
222
views
Intelligent Agents
»
more
ICAART 2010
»
Exploiting Similarity Information in Reinforcement Learning - Similarity Models for Multi-Armed Bandits and MDPs
14 years 5 months ago
Download
personal.unileoben.ac.at
Ronald Ortner
claim paper
Read More »
29
click to vote
ICAART
2010
INSTICC
288
views
Intelligent Agents
»
more
ICAART 2010
»
A Cautious Approach to Generalization in Reinforcement Learning
14 years 5 months ago
Download
www.montefiore.ulg.ac.be
Raphael Fonteneau, Susan A. Murphy, Louis Wehenkel...
claim paper
Read More »
24
click to vote
IUI
2009
ACM
110
views
Software Engineering
»
more
IUI 2009
»
A bayesian reinforcement learning approach for customizing human-robot interfaces
14 years 3 months ago
Download
www.cs.mcgill.ca
Amin Atrash, Joelle Pineau
claim paper
Read More »
24
click to vote
ISDA
2009
IEEE
144
views
Operating System
»
more
ISDA 2009
»
Postponed Updates for Temporal-Difference Reinforcement Learning
14 years 2 months ago
Download
www.science.uva.nl
This paper presents postponed updates, a new strategy for TD methods that can improve sample efficiency without incurring the computational and space requirements of model-based ...
Harm van Seijen, Shimon Whiteson
claim paper
Read More »
« Prev
« First
page 118 / 253
Last »
Next »