Search Sciweavers | Sciweavers

1235 search results - page 239 / 247

» Reinforcement learning in a nutshell

127

click to vote

COGSR
2011

71views more COGSR 2011»

Psychological models of human and optimal performance in bandit problems

14 years 10 months ago

Download www.socsci.uci.edu

In bandit problems, a decision-maker must choose between a set of alternatives, each of which has a ﬁxed but unknown rate of reward, to maximize their total number of rewards ov...

Michael D. Lee, Shunan Zhang, Miles Munro, Mark St...

claim paper

Read More »

107

click to vote

JAIR
2011

187views more JAIR 2011»

A Monte-Carlo AIXI Approximation

14 years 10 months ago

Download www.hutter1.net

This paper describes a computationally feasible approximation to the AIXI agent, a universal reinforcement learning agent for arbitrary environments. AIXI is scaled down in two ke...

Joel Veness, Kee Siong Ng, Marcus Hutter, William ...

claim paper

Read More »

173

click to vote

JCST
2010

109views more JCST 2010»

The Inverse Classification Problem

14 years 10 months ago

Download 210.14.113.38

In this paper, we examine an emerging variation of the classification problem, which is known as the inverse classification problem. In this problem, we determine the features to b...

Charu C. Aggarwal, Chen Chen, Jiawei Han

claim paper

Read More »

132

click to vote

AGI
2011

286views Artificial Intelligence» more AGI 2011»

Comparing Humans and AI Agents

14 years 6 months ago

Download users.dsic.upv.es

Comparing humans and machines is one important source of information about both machine and human strengths and limitations. Most of these comparisons and competitions are performe...

Javier Insa-Cabrera, David L. Dowe, Sergio Espa&nt...

claim paper

Read More »

147

Voted

CIMCA
2008
IEEE

125views Intelligent Agents» more CIMCA 2008»

Tree Exploration for Bayesian RL Exploration

15 years 9 months ago

Download arxiv.org

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The ﬁrst employs a Bayesian framework, ...

Christos Dimitrakakis

posted by olethros

Read More »

« Prev « First page 239 / 247 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers