Search Sciweavers | Sciweavers

2464 search results - page 144 / 493

» Efficient learning equilibrium

125

click to vote

AAAI
2010

173views Intelligent Agents» more AAAI 2010»

Integrating Sample-Based Planning and Model-Based Reinforcement Learning

15 years 4 months ago

Download paul.rutgers.edu

Recent advancements in model-based reinforcement learning have shown that the dynamics of many structured domains (e.g. DBNs) can be learned with tractable sample complexity, desp...

Thomas J. Walsh, Sergiu Goschin, Michael L. Littma...

claim paper

Read More »

145

click to vote

IJCAI
2001

151views Artificial Intelligence» more IJCAI 2001»

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

15 years 4 months ago

Download jmlr.csail.mit.edu

R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complet...

Ronen I. Brafman, Moshe Tennenholtz

claim paper

Read More »

137

click to vote

PAMI
1998

87views more PAMI 1998»

Learning Local Languages and Their Application to DNA Sequence Analysis

15 years 2 months ago

Download www.cs.ubc.ca

—This paper concerns an efficient algorithm for learning in the limit a special type of regular languages called strictly locally testable languages from positive data, and its a...

Takashi Yokomori, Satoshi Kobayashi

claim paper

Read More »

137

Voted

TSMC
2010

189views Artificial Intelligence» more TSMC 2010»

Active Learning of Plans for Safety and Reachability Goals With Partial Observability

14 years 9 months ago

Download www.personal.psu.edu

Traditional planning assumes reachability goals and/or full observability. In this paper, we propose a novel solution for safety and reachability planning with partial observabilit...

Wonhong Nam, Rajeev Alur

claim paper

Read More »

157

click to vote

PAMI
2012

191views Software Engineering» more PAMI 2012»

Task-Driven Dictionary Learning

13 years 5 months ago

Download www.di.ens.fr

—Modeling data with linear combinations of a few elements from a learned dictionary has been the focus of much recent research in machine learning, neuroscience, and signal proce...

Julien Mairal, Francis Bach, Jean Ponce

claim paper

Read More »

« Prev « First page 144 / 493 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers