Search Sciweavers | Sciweavers

262 search results - page 41 / 53

» Bounded-Parameter Partially Observable Markov Decision Proce...

click to vote

GECCO
2004
Springer

147views Optimization» more GECCO 2004»

A Demonstration of Neural Programming Applied to Non-Markovian Problems

14 years 4 months ago

Download cs.gmu.edu

Genetic programming may be seen as a recent incarnation of a long-held goal in evolutionary computation: to develop actual computational devices through evolutionary search. Geneti...

Gabriel Catalin Balan, Sean Luke

claim paper

Read More »

click to vote

NIPS
2003

145views Information Technology» more NIPS 2003»

A Nonlinear Predictive State Representation

14 years 16 days ago

Download books.nips.cc

Predictive state representations (PSRs) use predictions of a set of tests to represent the state of controlled dynamical systems. One reason why this representation is exciting as...

Matthew R. Rudary, Satinder P. Singh

claim paper

Read More »

click to vote

ICML
2009
IEEE

148views Machine Learning» more ICML 2009»

Predictive representations for policy gradient in POMDPs

14 years 12 months ago

Download damas.ift.ulaval.ca

We consider the problem of estimating the policy gradient in Partially Observable Markov Decision Processes (POMDPs) with a special class of policies that are based on Predictive ...

Abdeslam Boularias, Brahim Chaib-draa

claim paper

Read More »

click to vote

ICML
2006
IEEE

136views Machine Learning» more ICML 2006»

An analytic solution to discrete Bayesian reinforcement learning

14 years 12 months ago

Download www.cs.uwaterloo.ca

Reinforcement learning (RL) was originally proposed as a framework to allow agents to learn in an online fashion as they interact with their environment. Existing RL algorithms co...

Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, Kevi...

claim paper

Read More »

click to vote

CDC
2008
IEEE

197views Control Systems» more CDC 2008»

Dynamic spectrum access policies for cognitive radio

14 years 5 months ago

Download www.ifp.illinois.edu

—We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooper...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

« Prev « First page 41 / 53 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers