Search Sciweavers | Sciweavers

417 search results - page 37 / 84

» The Dynamics of Reinforcement Learning in Cooperative Multia...

click to vote

NIPS
1996

192views Information Technology» more NIPS 1996»

Multidimensional Triangulation and Interpolation for Reinforcement Learning

13 years 9 months ago

Download www.cs.cmu.edu

Dynamic Programming, Q-learning and other discrete Markov Decision Process solvers can be applied to continuous d-dimensional state-spaces by quantizing the state space into an arr...

Scott Davies

claim paper

Read More »

click to vote

ISNN
2007
Springer

116views Neural Networks» more ISNN 2007»

Online Dynamic Value System for Machine Learning

14 years 1 months ago

Download www.ent.ohiou.edu

A novel online dynamic value system for machine learning is proposed in this paper. The proposed system has a dual network structure: data processing network (DPN) and information ...

Haibo He, Janusz A. Starzyk

claim paper

Read More »

click to vote

ECAI
1992
Springer

119views Artificial Intelligence» more ECAI 1992»

Towards a Cooperation Knowledge Level For Collaborative Problem Solving

13 years 11 months ago

Download eprints.ecs.soton.ac.uk

The cooperation knowledge level is a new computer level specifically for multi-agent problem solvers which describes rich and explicit models of common social phenomena. A cooperat...

Nicholas R. Jennings

claim paper

Read More »

click to vote

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

13 years 10 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

click to vote

ECML
2007
Springer

167views Machine Learning» more ECML 2007»

Efficient Continuous-Time Reinforcement Learning with Adaptive State Graphs

13 years 11 months ago

Download www.igi.tugraz.at

Abstract. We present a new reinforcement learning approach for deterministic continuous control problems in environments with unknown, arbitrary reward functions. The difficulty of...

Gerhard Neumann, Michael Pfeiffer, Wolfgang Maass

claim paper

Read More »

« Prev « First page 37 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers