A feature-oriented product line is a family of programs that share a common set of features. A feature implements a stakeholder's requirement and represents a design deci
We propose two algorithms for Q-learning that use the two-timescale stochastic approximation methodology. The first of these updates Q-values of all feasible state