Search Sciweavers | Sciweavers

802 search results - page 83 / 161

» Experts in a Markov Decision Process

151

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

15 years 7 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

163

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 7 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

154

click to vote

EOR
2006

106views more EOR 2006»

Optimal dynamic assignment of a flexible worker on an open production line with specialists

15 years 5 months ago

Download users.iems.northwestern.edu

This paper models and analyzes serial production lines with specialists at each station and a single, cross-trained floating worker who can work at any station. We formulate Marko...

Linn I. Sennott, Mark P. Van Oyen, Seyed M. R. Ira...

claim paper

Read More »

182

click to vote

JAIR
2008

107views more JAIR 2008»

Planning with Durative Actions in Stochastic Domains

15 years 5 months ago

Download www.cs.washington.edu

Probabilistic planning problems are typically modeled as a Markov Decision Process (MDP). MDPs, while an otherwise expressive model, allow only for sequential, non-durative action...

Mausam, Daniel S. Weld

claim paper

Read More »

200

click to vote

GLOBECOM
2009
IEEE

149views Communications» more GLOBECOM 2009»

Dogfight in Spectrum: Jamming and Anti-Jamming in Multichannel Cognitive Radio Systems

15 years 3 months ago

Download web.eecs.utk.edu

Primary user emulation attack in multichannel cognitive radio systems is discussed. An attacker is assumed to be able to send primary-user-like signals during spectrum sensing peri...

Husheng Li, Zhu Han

claim paper

Read More »

« Prev « First page 83 / 161 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers