Search Sciweavers | Sciweavers

158

EOR
2006

81views more EOR 2006»

Optimal and near-optimal policies for lost sales inventory models with at most one replenishment order outstanding

15 years 6 months ago

In this paper we use policy-iteration to explore the behaviour of optimal control policies for lost sales inventory models with the constraint that not more than one replenishment...

Roger M. Hill, Søren Glud Johansen

claim paper

Read More »

177

click to vote

INFOCOM
2005
IEEE

173views Communications» more INFOCOM 2005»

Asymptotically optimal transmission policies for low-power wireless sensor networks

16 years 8 days ago

Download people.bu.edu

— We consider wireless sensor networks with multiple gateways and multiple classes of trafﬁc carrying data generated by different sensory inputs. The objective is to devise joi...

Ioannis Ch. Paschalidis, Wei Lai, David Starobinsk...

claim paper

Read More »

186

click to vote

POLICY
2004
Springer

88views Computer Networks» more POLICY 2004»

Responding to Policies at Runtime in TrustBuilder

16 years 1 days ago

Download www.itr-rescue.org

Automated trust negotiation is the process of establishing trust between entities with no prior relationship through the iterative disclosure of digital credentials. One approach ...

Bryan Smith, Kent E. Seamons, Michael D. Jones

claim paper

Read More »

179

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

158

Voted

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

16 years 7 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers