Search Sciweavers | Sciweavers

94 search results - page 14 / 19

» Sequential cost-sensitive decision making with reinforcement...

141

click to vote

ROBOCUP
2005
Springer

151views Robotics» more ROBOCUP 2005»

Sequential Pattern Mining for Situation and Behavior Prediction in Simulated Robotic Soccer

15 years 11 months ago

Download www.informatik.uni-frankfurt.de

Agents in dynamic environments have to deal with world rep- To appear in: RoboCup 2005: Robot Soccer World Cup IX, c Springer-Verlag, 2006 resentations that change over time. In or...

Andreas D. Lattner, Andrea Miene, Ubbo Visser, Ott...

claim paper

Read More »

169

click to vote

IJCAI
2007

201views Artificial Intelligence» more IJCAI 2007»

Using Linear Programming for Bayesian Exploration in Markov Decision Processes

15 years 7 months ago

Download www.cs.mcgill.ca

A key problem in reinforcement learning is ﬁnding a good balance between the need to explore the environment and the need to gain rewards by exploiting existing knowledge. Much ...

Pablo Samuel Castro, Doina Precup

claim paper

Read More »

171

click to vote

COLT
2006
Springer

63views Machine Learning» more COLT 2006»

Online Learning with Constraints

15 years 9 months ago

Download isaim2008.unl.edu

In this paper, we study a sequential decision making problem. The objective is to maximize the total reward while satisfying constraints, which are defined at every time step. The...

Shie Mannor, John N. Tsitsiklis

claim paper

Read More »

153

click to vote

AAAI
2008

141views Intelligent Agents» more AAAI 2008»

Online Learning with Expert Advice and Finite-Horizon Constraints

15 years 8 months ago

Download www.aaai.org

In this paper, we study a sequential decision making problem. The objective is to maximize the average reward accumulated over time subject to temporal cost constraints. The novel...

Branislav Kveton, Jia Yuan Yu, Georgios Theocharou...

claim paper

Read More »

185

click to vote

ATAL
2005
Springer

148views Intelligent Agents» more ATAL 2005»

An integrated framework for adaptive reasoning about conversation patterns

15 years 11 months ago

Download homepages.inf.ed.ac.uk

We present an integrated approach for reasoning about and learning conversation patterns in multiagent communication. The approach is based on the assumption that information abou...

Michael Rovatsos, Felix A. Fischer, Gerhard Wei&sz...

claim paper

Read More »

« Prev « First page 14 / 19 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers