Search Sciweavers | Sciweavers

75 search results - page 14 / 15

» A Predictive Model for Imitation Learning in Partially Obser...

115

Voted

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

14 years 10 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

137

Voted

NECO
2007

150views more NECO 2007»

Reinforcement Learning, Spike-Time-Dependent Plasticity, and the BCM Rule

15 years 3 months ago

Download eprints.pascal-network.org

Learning agents, whether natural or artiﬁcial, must update their internal parameters in order to improve their behavior over time. In reinforcement learning, this plasticity is ...

Dorit Baras, Ron Meir

claim paper

Read More »

142

click to vote

KDD
2009
ACM

191views Data Mining» more KDD 2009»

Improving data mining utility with projective sampling

15 years 8 months ago

Download www-ai.cs.uni-dortmund.de

Overall performance of the data mining process depends not just on the value of the induced knowledge but also on various costs of the process itself such as the cost of acquiring...

Mark Last

claim paper

Read More »

148

Voted

ENVSOFT
2008

175views more ENVSOFT 2008»

Automated regression-based statistical downscaling tool

15 years 3 months ago

Download www.cen.ulaval.ca

Many impact studies require climate change information at a finer resolution than that provided by Global Climate Models (GCMs). In the last 10 years, downscaling techniques, both...

Masoud Hessami, Philippe Gachon, Taha B. M. J. Oua...

claim paper

Read More »

144

click to vote

AAAI
2008

169views Intelligent Agents» more AAAI 2008»

Perpetual Learning for Non-Cooperative Multiple Agents

15 years 6 months ago

Download www.aaai.org

This paper examines, by argument, the dynamics of sequences of behavioural choices made, when non-cooperative restricted-memory agents learn in partially observable stochastic gam...

Luke Dickens

claim paper

Read More »

« Prev « First page 14 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers