Search Sciweavers | Sciweavers

12194 search results - page 118 / 2439

» Numberings Optimal for Learning

129

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

16 years 5 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

133

Voted

CIKM
2008
Springer

117views Information Technology» more CIKM 2008»

Suppressing outliers in pairwise preference ranking

15 years 6 months ago

Download www.cs.cmu.edu

Many of the recently proposed algorithms for learning feature-based ranking functions are based on the pairwise preference framework, in which instead of taking documents in isola...

Vitor R. Carvalho, Jonathan L. Elsas, William W. C...

claim paper

Read More »

140

click to vote

GECCO
2008
Springer

179views Optimization» more GECCO 2008»

Developing neural structure of two agents that play checkers using cartesian genetic programming

15 years 5 months ago

Download www.cs.bham.ac.uk

A developmental model of neural network is presented and evaluated in the game of Checkers. The network is developed using cartesian genetic programs (CGP) as genotypes. Two agent...

Gul Muhammad Khan, Julian Francis Miller, David M....

claim paper

Read More »

183

click to vote

GECCO
2008
Springer

178views Optimization» more GECCO 2008»

Agent Smith: a real-time game-playing agent for interactive dynamic games

15 years 5 months ago

Download www.cs.bham.ac.uk

The goal of this project is to develop an agent capable of learning and behaving autonomously and making decisions quickly in a dynamic environment. The agent’s environment is a...

Ryan K. Small

claim paper

Read More »

148

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

15 years 10 months ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

« Prev « First page 118 / 2439 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers