Search Sciweavers | Sciweavers

360 search results - page 5 / 72

» Learning Evaluation Functions for Large Acyclic Domains

click to vote

GECCO
2006
Springer

133views Optimization» more GECCO 2006»

On-line evolutionary computation for reinforcement learning in stochastic domains

13 years 11 months ago

Download userweb.cs.utexas.edu

In reinforcement learning, an agent interacting with its environment strives to learn a policy that specifies, for each state it may encounter, what action to take. Evolutionary c...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

BMCBI
2006

78views more BMCBI 2006»

An evaluation of human protein-protein interaction data in the public domain

13 years 7 months ago

Download www.biomedcentral.com

Background: Protein-protein interaction (PPI) databases have become a major resource for investigating biological networks and pathways in cells. A number of publicly available re...

Suresh Mathivanan, Balamurugan Periaswamy, T. K. B...

claim paper

Read More »

click to vote

AAAI
2006

108views Intelligent Agents» more AAAI 2006»

Using Homomorphisms to Transfer Options across Continuous Reinforcement Learning Domains

13 years 8 months ago

Download www.eecs.umich.edu

We examine the problem of Transfer in Reinforcement Learning and present a method to utilize knowledge acquired in one Markov Decision Process (MDP) to bootstrap learning in a mor...

Vishal Soni, Satinder P. Singh

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

13 years 7 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

click to vote

ACML
2009
Springer

300views Machine Learning» more ACML 2009»

Learning Algorithms for Domain Adaptation

14 years 3 days ago

Download www.cs.cmu.edu

A fundamental assumption for any machine learning task is to have training and test data instances drawn from the same distribution while having a sufﬁciently large number of tra...

Manas A. Pathak, Eric Nyberg

claim paper

Read More »

« Prev « First page 5 / 72 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers