Search Sciweavers | Sciweavers

212

Voted

AIIDE
2008

186views Artificial Intelligence» more AIIDE 2008»

Combining Model-Based Meta-Reasoning and Reinforcement Learning for Adapting Game-Playing Agents

15 years 9 months ago

Human experience with interactive games will be enhanced if the software agents that play the game learn from their failures. Techniques such as reinforcement learning provide one...

Patrick Ulam, Joshua Jones, Ashok K. Goel

claim paper

Read More »

194

click to vote

ICMLA
2004

109views Machine Learning» more ICMLA 2004»

Variable resolution discretization in the joint space

15 years 8 months ago

Download highentropy.com

We present JoSTLe, an algorithm that performs value iteration on control problems with continuous actions, allowing this useful reinforcement learning technique to be applied to p...

Christopher K. Monson, David Wingate, Kevin D. Sep...

claim paper

Read More »

217

click to vote

AAMAS
2002
Springer

130views Intelligent Agents» more AAMAS 2002»

Relational Reinforcement Learning for Agents in Worlds with Objects

15 years 7 months ago

Download www-ai.ijs.si

In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action in a given state of the environment, so that it maximizes the total amount of reward it ...

Saso Dzeroski

claim paper

Read More »

173

Voted

IEAAIE
2001
Springer

98views Artificial Intelligence» more IEAAIE 2001»

On the Relationship between Learning Capability and the Boltzmann-Formula

15 years 12 months ago

Download members.iif.hu

In this paper a combined use of reinforcement learning and simulated annealing is treated. Most of the simulated annealing methods suggest using heuristic temperature bounds as the...

Péter Stefán, Laszlo Monostori

claim paper

Read More »

211

Voted

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers