Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

159

ICML
2008
IEEE

105views Machine Learning» more ICML 2008»

Learning all optimal policies with multiple criteria

16 years 8 months ago

Learning all optimal policies with multiple criteria

Download leon.barrettnexus.com

We describe an algorithm for learning in the presence of multiple criteria. Our technique generalizes previous approaches in that it can learn optimal policies for all linear preference assignments over the multiple reward criteria at once. The algorithm can be viewed as an extension to standard reinforcement learning for MDPs where instead of repeatedly backing up maximal expected rewards, we back up the set of expected rewards that are maximal for some set of linear preferences (given by a weight vector, -w ). We present the algorithm along with a proof of correctness showing that our solution gives the optimal policy for any linear preference function. The solution reduces to the standard value iteration algorithm for a specific weight vector, -w .

Leon Barrett, Srini Narayanan

Real-time Traffic

ICML 2008 | Linear Preference Assignments | Machine Learning | Maximal Expected Rewards | Value Iteration Algorithm |

claim paper

Related Content

» Multiagent learning using a variable learning rate

» Toward OffPolicy Learning Control with Function Approximation

» Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

» Distributed WLearning MultiPolicy Optimization in SelfOrganizing Systems

» Using genetic algorithm for dynamic and multiple criteria website optimizations

» Distributed learning in cognitive radio networks Multiarmed bandit with distributed multip...

» MultipleGoal Reinforcement Learning with Modular Sarsa0

» Optimal Buffer Management Policies for Delay Tolerant Networks

» The Cross Entropy Method for Fast Policy Search

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2008
Where	ICML
Authors	Leon Barrett, Srini Narayanan

Comments (0)