Search Sciweavers | Sciweavers

209 search results - page 30 / 42

» Optimization and Convergence of Observation Channels in Stoc...

164

click to vote

IJCAI
2003

173views Artificial Intelligence» more IJCAI 2003»

A Planning Algorithm for Predictive State Representations

15 years 7 months ago

Download dli.iiit.ac.in

We address the problem of optimally controlling stochastic environments that are partially observable. The standard method for tackling such problems is to define and solve a Part...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

148

click to vote

ICML
2000
IEEE

126views Machine Learning» more ICML 2000»

Reinforcement Learning in POMDP's via Direct Gradient Ascent

16 years 6 months ago

Download reference.kfupm.edu.sa

This paper discusses theoretical and experimental aspects of gradient-based approaches to the direct optimization of policy performance in controlled ??? ?s. We introduce ??? ?, a...

Jonathan Baxter, Peter L. Bartlett

claim paper

Read More »

137

click to vote

GECCO
2007
Springer

160views Optimization» more GECCO 2007»

An analysis of constructive crossover and selection pressure in genetic programming

15 years 12 months ago

Download www.cs.bham.ac.uk

A common problem in genetic programming search algorithms is destructive crossover in which the oﬀspring of good parents generally has worse performance than the parents. Design...

Huayang Xie, Mengjie Zhang, Peter Andreae

claim paper

Read More »

171

click to vote

JSAC
2007

189views more JSAC 2007»

Non-Cooperative Power Control for Wireless Ad Hoc Networks with Repeated Games

15 years 5 months ago

Download www.cs.ust.hk

— One of the distinctive features in a wireless ad hoc network is lack of any central controller or single point of authority, in which each node/link then makes its own decision...

Chengnian Long, Qian Zhang, Bo Li, Huilong Yang, X...

claim paper

Read More »

149

click to vote

CORR
2007
Springer

73views Education» more CORR 2007»

Universal Reinforcement Learning

15 years 5 months ago

Download www.stanford.edu

—We consider an agent interacting with an unmodeled environment. At each time, the agent makes an observation, takes an action, and incurs a cost. Its actions can inﬂuence futu...

Vivek F. Farias, Ciamac Cyrus Moallemi, Tsachy Wei...

claim paper

Read More »

« Prev « First page 30 / 42 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers