Sciweavers

91 search results - page 4 / 19
» Integrating Reinforcement Learning into a Programming Langua...
Sort
View
ECML
2007
Springer
13 years 9 months ago
Sequence Labeling with Reinforcement Learning and Ranking Algorithms
Many problems in areas such as Natural Language Processing, Information Retrieval, or Bioinformatic involve the generic task of sequence labeling. In many cases, the aim is to assi...
Francis Maes, Ludovic Denoyer, Patrick Gallinari
ICRA
2000
IEEE
101views Robotics» more  ICRA 2000»
14 years 1 days ago
Towards Programming Tools for Robots that Integrate Probabilistic Computation and Learning
This paper describes a programming language extension of C++, called CES, specifically targeted towards mobile robot control. CES’s design is motivated by a recent series of su...
Sebastian Thrun
ATAL
2007
Springer
14 years 1 months ago
Model-based function approximation in reinforcement learning
Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...
Nicholas K. Jong, Peter Stone
ACSE
2000
ACM
14 years 18 hour ago
The information environments program - a new design based IT degree
The University of Queensland has recently established a new design-focused, studio-based IT degree at a new “flexible-learning” campus. The Bachelor of Information Environment...
Michael Docherty, Peter Sutton, Margot Brereton, S...
JMLR
2012
11 years 10 months ago
Contextual Bandit Learning with Predictable Rewards
Contextual bandit learning is a reinforcement learning problem where the learner repeatedly receives a set of features (context), takes an action and receives a reward based on th...
Alekh Agarwal, Miroslav Dudík, Satyen Kale,...