Search Sciweavers | Sciweavers

111 search results - page 20 / 23

» Reinforcement Learning for Operational Space Control

124

Voted

NN
2010
Springer

125views Neural Networks» more NN 2010»

Parameter-exploring policy gradients

15 years 1 months ago

Download www.kyb.mpg.de

We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...

Frank Sehnke, Christian Osendorfer, Thomas Rü...

claim paper

Read More »

125

click to vote

AGENTS
2000
Springer

119views Security Privacy» more AGENTS 2000»

Adaptivity in agent-based routing for data networks

15 years 7 months ago

Download web.engr.oregonstate.edu

Adaptivity, both of the individual agents and of the interaction structure among the agents, seems indispensable for scaling up multi-agent systems MAS's in noisy environme...

David Wolpert, Sergey Kirshner, Christopher J. Mer...

claim paper

Read More »

146

Voted

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 4 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

128

Voted

AAAI
2008

204views Intelligent Agents» more AAAI 2008»

Adaptive Management of Air Traffic Flow: A Multiagent Coordination Approach

15 years 4 months ago

Download www.aaai.org

This paper summarizes recent advances in the application of multiagent coordination algorithms to air traffic flow management. Indeed, air traffic flow management is one of the fu...

Kagan Tumer, Adrian K. Agogino

claim paper

Read More »

113

Voted

ISORC
2000
IEEE

144views Distributed And Parallel Com...» more ISORC 2000»

Establishing a Data-Mining Environment for Wartime Event Prediction with an Object-Oriented Command and Control Database

15 years 7 months ago

Download www.spawar.navy.mil

This paper documents progress to date on a research project, the goal of which is wartime event prediction. The paper describes the operational concept, the datamining environment...

Marion G. Ceruti, S. Joe McCarthy

claim paper

Read More »

« Prev « First page 20 / 23 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers