Search Sciweavers | Sciweavers

98 search results - page 14 / 20

» Using Rewards for Belief State Updates in Partially Observab...

click to vote

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

13 years 7 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

click to vote

AAAI
2006

134views Intelligent Agents» more AAAI 2006»

Point-based Dynamic Programming for DEC-POMDPs

13 years 9 months ago

Download hal.archives-ouvertes.fr

We introduce point-based dynamic programming (DP) for decentralized partially observable Markov decision processes (DEC-POMDPs), a new discrete DP algorithm for planning strategie...

Daniel Szer, François Charpillet

claim paper

Read More »

click to vote

FLAIRS
2001

140views Artificial Intelligence» more FLAIRS 2001»

Probabilistic Planning for Behavior-Based Robots

13 years 9 months ago

Download www.atrash.com

Partially Observable Markov Decision Process models (POMDPs) have been applied to low-level robot control. We show how to use POMDPs differently, namely for sensorplanning in the ...

Amin Atrash, Sven Koenig

claim paper

Read More »

click to vote

ISIPTA
2005
IEEE

161views Mathematics» more ISIPTA 2005»

Decision making under incomplete data using the imprecise Dirichlet model

14 years 1 months ago

Download www.sipta.org

The paper presents an eﬃcient solution to decision problems where direct partial information on the distribution of the states of nature is available, either by observations of ...

Lev V. Utkin, Thomas Augustin

claim paper

Read More »

click to vote

VTC
2008
IEEE

185views Communications» more VTC 2008»

Opportunistic Spectrum Access for Energy-Constrained Cognitive Radios

14 years 2 months ago

Download www1.i2r.a-star.edu.sg

This paper considers a scenario in which a secondary user makes opportunistic use of a channel allocated to some primary network. The primary network operates in a time-slotted ma...

Anh Tuan Hoang, Ying-Chang Liang, David Tung Chong...

claim paper

Read More »

« Prev « First page 14 / 20 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers