Sciweavers

94 search results - page 14 / 19
» Sequential cost-sensitive decision making with reinforcement...
Sort
View
ROBOCUP
2005
Springer
151views Robotics» more  ROBOCUP 2005»
14 years 29 days ago
Sequential Pattern Mining for Situation and Behavior Prediction in Simulated Robotic Soccer
Agents in dynamic environments have to deal with world rep- To appear in: RoboCup 2005: Robot Soccer World Cup IX, c Springer-Verlag, 2006 resentations that change over time. In or...
Andreas D. Lattner, Andrea Miene, Ubbo Visser, Ott...
IJCAI
2007
13 years 9 months ago
Using Linear Programming for Bayesian Exploration in Markov Decision Processes
A key problem in reinforcement learning is finding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...
Pablo Samuel Castro, Doina Precup
COLT
2006
Springer
13 years 11 months ago
Online Learning with Constraints
In this paper, we study a sequential decision making problem. The objective is to maximize the total reward while satisfying constraints, which are defined at every time step. The...
Shie Mannor, John N. Tsitsiklis
AAAI
2008
13 years 9 months ago
Online Learning with Expert Advice and Finite-Horizon Constraints
In this paper, we study a sequential decision making problem. The objective is to maximize the average reward accumulated over time subject to temporal cost constraints. The novel...
Branislav Kveton, Jia Yuan Yu, Georgios Theocharou...
ATAL
2005
Springer
14 years 1 months ago
An integrated framework for adaptive reasoning about conversation patterns
We present an integrated approach for reasoning about and learning conversation patterns in multiagent communication. The approach is based on the assumption that information abou...
Michael Rovatsos, Felix A. Fischer, Gerhard Wei&sz...