Search Sciweavers | Sciweavers

1233 search results - page 226 / 247

» Feudal Reinforcement Learning

135

click to vote

CORR
2010
Springer

124views Education» more CORR 2010»

Mimicking the Behaviour of Idiotypic AIS Robot Controllers Using Probabilistic Systems

15 years 5 months ago

Download ima.ac.uk

Previous work has shown that robot navigation systems that employ an architecture based upon the idiotypic network theory of the immune system have an advantage over control techn...

Amanda M. Whitbrook, Uwe Aickelin, Jonathan M. Gar...

claim paper

Read More »

158

Voted

CORR
2010
Springer

126views Education» more CORR 2010»

The Use of Probabilistic Systems to Mimic the Behaviour of Idiotypic AIS Robot Controllers

15 years 5 months ago

Download ima.ac.uk

Previous work has shown that robot navigation systems that employ an architecture based upon the idiotypic network theory of the immune system have an advantage over control techn...

Amanda M. Whitbrook, Uwe Aickelin, Jonathan M. Gar...

claim paper

Read More »

159

click to vote

ICCS
1993
Springer

99views Applied Computing» more ICCS 1993»

Towards Domain-Independent Machine Intelligence

15 years 9 months ago

Download www.soe.ucsc.edu

Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....

Robert Levinson

claim paper

Read More »

174

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 7 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

138

click to vote

ATAL
2008
Springer

146views Intelligent Agents» more ATAL 2008»

Adaptive Kanerva-based function approximation for multi-agent systems

15 years 7 months ago

Download www.aamas-conference.org

In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on Kanerva Coding when solving largescale instanc...

Cheng Wu, Waleed Meleis

claim paper

Read More »

« Prev « First page 226 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers