Search Sciweavers | Sciweavers

108

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 10 months ago

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

108

click to vote

WSC
2007

106views Modeling And Simulation» more WSC 2007»

Simulation of scheduled ordering policies in distribution supply chains

15 years 6 months ago

Download www.informs-sim.org

In this paper we study a decentralized distribution supply chain with one supplier and many newsvendor-type retailers that face exogenous end-customer demands. Using total supply ...

Lucy G. Chen, Srinagesh Gavirneni

claim paper

Read More »

128

click to vote

IJCAI
2007

135views Artificial Intelligence» more IJCAI 2007»

Using Learned Policies in Heuristic-Search Planning

15 years 5 months ago

Download www2.parc.com

Many current state-of-the-art planners rely on forward heuristic search. The success of such search typically depends on heuristic distance-to-the-goal estimates derived from the ...

Sung Wook Yoon, Alan Fern, Robert Givan

claim paper

Read More »

127

click to vote

POLICY
2007
Springer

90views Computer Networks» more POLICY 2007»

A Socio-cognitive Approach to Modeling Policies in Open Environments

15 years 10 months ago

Download www.isi.edu

The richness of today’s electronic communications mirrors physical world: activities such as shopping, business and scientific collaboration are conducted online. Current intera...

Tatyana Ryutov

claim paper

Read More »

123

click to vote

VTC
2007
IEEE

110views Communications» more VTC 2007»

Multi-Channel Radio Resource Distribution Policies in Heterogeneous Traffic Scenarios

15 years 10 months ago

Download www.uwicore.umh.es

—Multi-channel operation in wireless systems has been proposed to increase user throughput and reduce transmission delays. However, multi-channel operation requires adequate reso...

M. Carmen Lucas-Estan, Javier Gozálvez, Joa...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers