Search Sciweavers | Sciweavers

36 search results - page 6 / 8

» Efficient On-the-Fly Algorithms for Partially Observable Tim...

click to vote

ALT
2005
Springer

137views Machine Learning» more ALT 2005»

Defensive Universal Learning with Experts

14 years 4 months ago

Download www.idsia.ch

This paper shows how universal learning can be achieved with expert advice. To this aim, we specify an experts algorithm with the following characteristics: (a) it uses only feedba...

Jan Poland, Marcus Hutter

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 8 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

AAAI
2010

218views Intelligent Agents» more AAAI 2010»

Multi-Agent Plan Recognition: Formalization and Algorithms

13 years 9 months ago

Download orca.st.usm.edu

Multi-Agent Plan Recognition (MAPR) seeks to identify the dynamic team structures and team behaviors from the observations of the activity-sequences of a set of intelligent agents...

Bikramjit Banerjee, Landon Kraemer, Jeremy Lyle

claim paper

Read More »

click to vote

ATAL
2010
Springer

171views Intelligent Agents» more ATAL 2010»

Closing the learning-planning loop with predictive state representations

13 years 8 months ago

Download www.cs.cmu.edu

A central problem in artificial intelligence is to choose actions to maximize reward in a partially observable, uncertain environment. To do so, we must learn an accurate model of ...

Byron Boots, Sajid M. Siddiqi, Geoffrey J. Gordon

claim paper

Read More »

click to vote

ATAL
2008
Springer

99views Intelligent Agents» more ATAL 2008»

Not all agents are equal: scaling up distributed POMDPs for agent networks

13 years 9 months ago

Download teamcore.usc.edu

Many applications of networks of agents, including mobile sensor networks, unmanned air vehicles, autonomous underwater vehicles, involve 100s of agents acting collaboratively und...

Janusz Marecki, Tapana Gupta, Pradeep Varakantham,...

claim paper

Read More »

« Prev « First page 6 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers