In this paper we examine the problem of estimating the parameters of a multinomial distribution over a large number of discreteoutcomes,most of which do not appearin the training ...
We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...
We study the problem of agents attempting to find quality service providers in a distributed environment. While referrals from other agents can be used to locate high-quality prov...
Recent evidence suggests that some characteristics of computer and telecommunications systems may be well described using heavy tailed distributions — distributions whose tail d...
The use of net ecosystem exchange (NEE) data to estimate carbon transfer coefficients is investigated in the context of a deterministic compartmental carbon sequestration system. ...