Search Sciweavers | Sciweavers

377 search results - page 27 / 76

» Optimizing Production Manufacturing Using Reinforcement Lear...

166

click to vote

ATAL
2007
Springer

130views Intelligent Agents» more ATAL 2007»

Theoretical advantages of lenient Q-learners: an evolutionary game theoretic perspective

16 years 29 days ago

Download www.aamas-conference.org

This paper presents the dynamics of multiple reinforcement learning agents from an Evolutionary Game Theoretic (EGT) perspective. We provide a Replicator Dynamics model for tradit...

Liviu Panait, Karl Tuyls

claim paper

Read More »

187

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

15 years 8 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

209

click to vote

IAT
2005
IEEE

180views Intelligent Agents» more IAT 2005»

Self-Organizing Cognitive Agents and Reinforcement Learning in Multi-Agent Environment

16 years 12 days ago

Download www3.ntu.edu.sg

This paper presents a self-organizing cognitive architecture, known as TD-FALCON, that learns to function through its interaction with the environment. TD-FALCON learns the value ...

Ah-Hwee Tan, Dan Xiao

claim paper

Read More »

163

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 7 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

169

click to vote

ESANN
2003

152views Neural Networks» more ESANN 2003»

Improving iterative repair strategies for scheduling with the SVM

15 years 8 months ago

Download www2.in.tu-clausthal.de

The resource constraint project scheduling problem (RCPSP) is an NP-hard benchmark problem in scheduling which takes into account the limitation of resources’ availabilities in ...

Kai Gersmann, Barbara Hammer

claim paper

Read More »

« Prev « First page 27 / 76 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers