Sciweavers

169

Voted

ICML
2005
IEEE

119views Machine Learning» more ICML 2005»

Dynamic preferences in multi-criteria reinforcement learning

16 years 7 months ago

The current framework of reinforcement learning is based on maximizing the expected returns based on scalar rewards. But in many real world situations, tradeoffs must be made amon...

Sriraam Natarajan, Prasad Tadepalli

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers