Search Sciweavers | Sciweavers

945 search results - page 119 / 189

» Dialog Convergence and Learning

212

click to vote

ITNG
2007
IEEE

118views Information Technology» more ITNG 2007»

Input Fuzzy Modeling for the Recognition of Handwritten Hindi Numerals

16 years 1 months ago

Download eprints.qut.edu.au

This paper presents the recognition of Handwritten Hindi Numerals based on the modified exponential membership function fitted to the fuzzy sets derived from normalized distance f...

Madasu Hanmandlu, J. Grover, Vamsi Krishna Madasu,...

claim paper

Read More »

180

Voted

AI
2007
Springer

183views Artificial Intelligence» more AI 2007»

Competition and Coordination in Stochastic Games

16 years 1 months ago

Download www.damas.ift.ulaval.ca

Agent competition and coordination are two classical and most important tasks in multiagent systems. In recent years, there was a number of learning algorithms proposed to resolve ...

Andriy Burkov, Abdeslam Boularias, Brahim Chaib-dr...

claim paper

Read More »

213

click to vote

ATAL
2003
Springer

176views Intelligent Agents» more ATAL 2003»

A selection-mutation model for q-learning in multi-agent systems

16 years 21 days ago

Download www.personeel.unimaas.nl

Although well understood in the single-agent framework, the use of traditional reinforcement learning (RL) algorithms in multi-agent systems (MAS) is not always justiﬁed. The fe...

Karl Tuyls, Katja Verbeeck, Tom Lenaerts

claim paper

Read More »

220

Voted

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

15 years 9 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

202

click to vote

ICMLA
2003

123views Machine Learning» more ICMLA 2003»

The Consolidation of Neural Network Task Knowledge

15 years 8 months ago

Download plato.acadiau.ca

— Fundamental to the problem of lifelong machine learning is how to consolidate the knowledge of a learned task within a long-term memory structure (domain knowledge) without the...

Daniel L. Silver, Peter McCracken

claim paper

Read More »

« Prev « First page 119 / 189 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers