Search Sciweavers | Sciweavers

69 search results - page 13 / 14

» PAC-Bayesian Policy Evaluation for Reinforcement Learning

134

click to vote

AAAI
2000

147views Intelligent Agents» more AAAI 2000»

ADVISOR: A Machine Learning Architecture for Intelligent Tutor Construction

15 years 7 months ago

Download www.aaai.org

We have constructed ADVISOR, a two-agent machine learning architecture for intelligent tutoring systems (ITS). The purpose of this architecture is to centralize the reasoning of a...

Joseph Beck, Beverly Park Woolf, Carole R. Beal

claim paper

Read More »

144

click to vote

JNW
2006

63views more JNW 2006»

MAC Contention in a Wireless LAN with Noncooperative Anonymous Stations

15 years 5 months ago

Download www.academypublisher.com

In ad hoc wireless LANs populated by mutually impenetrable groups of anonymous stations, honest stations are prone to "bandwidth stealing" by selfish stations. The proble...

Jerzy Konorski

claim paper

Read More »

189

click to vote

IAT
2010
IEEE

133views Intelligent Agents» more IAT 2010»

Multiagent Meta-level Control for a Network of Weather Radars

15 years 3 months ago

Download coitweb.uncc.edu

It is crucial for embedded systems to adapt to the dynamics of open environments. This adaptation process becomes especially challenging in the context of multiagent systems. In t...

Shanjun Cheng, Anita Raja, Victor R. Lesser

claim paper

Read More »

184

click to vote

JMLR
2006

124views more JMLR 2006»

Policy Gradient in Continuous Time

15 years 5 months ago

Download hal.inria.fr

Policy search is a method for approximately solving an optimal control problem by performing a parametric optimization search in a given class of parameterized policies. In order ...

Rémi Munos

claim paper

Read More »

166

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

15 years 5 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

« Prev « First page 13 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers