Search Sciweavers | Sciweavers

2075 search results - page 132 / 415

» Learning better transliterations

161

Voted

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

15 years 6 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

126

click to vote

NIPS
2004

138views Information Technology» more NIPS 2004»

New Criteria and a New Algorithm for Learning in Multi-Agent Systems

15 years 6 months ago

Download books.nips.cc

We propose a new set of criteria for learning algorithms in multi-agent systems, one that is more stringent and (we argue) better justified than previous proposed criteria. Our cr...

Rob Powers, Yoav Shoham

claim paper

Read More »

178

Voted

UAI
2003

125views Artificial Intelligence» more UAI 2003»

Robust Independence Testing for Constraint-Based Learning of Causal Structure

15 years 6 months ago

Download www.pitt.edu

This paper considers a method that combines ideas from Bayesian learning, Bayesian network inference, and classical hypothesis testing to produce a more reliable and robust test o...

Denver Dash, Marek J. Druzdzel

claim paper

Read More »

126

click to vote

IJCAI
1993

107views Artificial Intelligence» more IJCAI 1993»

Learning Finite Automata Using Local Distinguishing Experiments

15 years 6 months ago

Download www.isi.edu

One of the open problems listed in Rivest and Schapire, 1989] is whether and how that the copies of L in their algorithm can be combined into one for better performance. This pape...

Wei-Mein Shen

claim paper

Read More »

148

Voted

NIPS
1993

86views Information Technology» more NIPS 1993»

Robust Reinforcement Learning in Motion Planning

15 years 6 months ago

Download www.cs.cmu.edu

While exploring to nd better solutions, an agent performing online reinforcement learning (RL) can perform worse than is acceptable. In some cases, exploration might have unsafe, ...

Satinder P. Singh, Andrew G. Barto, Roderic A. Gru...

claim paper

Read More »

« Prev « First page 132 / 415 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers