state-action pairs | Sciweavers

205

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 9 months ago

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

213

click to vote

ILP
2003
Springer

126views Automated Reasoning» more ILP 2003»

Graph Kernels and Gaussian Processes for Relational Reinforcement Learning

16 years 25 days ago

Download dtai.cs.kuleuven.be

RRL is a relational reinforcement learning system based on Q-learning in relational state-action spaces. It aims to enable agents to learn how to act in an environment that has no ...

Thomas Gärtner, Kurt Driessens, Jan Ramon

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers