Search Sciweavers | Sciweavers

120

CG
2006
Springer

155views Computer Graphics» more CG 2006»

Feature Construction for Reinforcement Learning in Hearts

15 years 4 months ago

Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...

Nathan R. Sturtevant, Adam M. White

claim paper

Read More »

103

click to vote

ECML
2007
Springer

133views Machine Learning» more ECML 2007»

Transfer Learning in Reinforcement Learning Problems Through Partial Policy Recycling

15 years 8 months ago

Download dtai.cs.kuleuven.be

In this paper we investigate the relation between transfer learning in reinforcement learning with function approximation and supervised learning with concept drift. We present a n...

Jan Ramon, Kurt Driessens, Tom Croonenborghs

claim paper

Read More »

111

Voted

AAAI
2011

149views Intelligent Agents» more AAAI 2011»

Optimal Rewards versus Leaf-Evaluation Heuristics in Planning Agents

14 years 2 months ago

Download www.eecs.umich.edu

Planning agents often lack the computational resources needed to build full planning trees for their environments. Agent designers commonly overcome this ﬁnite-horizon approxima...

Jonathan Sorg, Satinder P. Singh, Richard L. Lewis

claim paper

Read More »

149

click to vote

CEC
2011
IEEE

278views Artificial Intelligence» more CEC 2011»

Cost-benefit analysis of using heuristics in ACGP

14 years 2 months ago

Download mercury.webster.edu

—Constrained Genetic Programming (CGP) is a method of searching the Genetic Programming search space non-uniformly, giving preferences to certain subspaces according to some heur...

John W. Aleshunas, Cezary Z. Janikow

claim paper

Read More »

135

Voted

BMCBI
2002

126views more BMCBI 2002»

RIO: Analyzing proteomes by automated phylogenomics using resampled inference of orthologs

15 years 2 months ago

Download www.biomedcentral.com

Background: When analyzing protein sequences using sequence similarity searches, orthologous sequences (that diverged by speciation) are more reliable predictors of a new protein&...

Christian M. Zmasek, Sean R. Eddy

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers