Search Sciweavers | Sciweavers

4973 search results - page 852 / 995

» Probabilistic Algorithms in Robotics

136

click to vote

ICML
2003
IEEE

121views Machine Learning» more ICML 2003»

Planning in the Presence of Cost Functions Controlled by an Adversary

16 years 5 months ago

Download www.cs.cmu.edu

We investigate methods for planning in a Markov Decision Process where the cost function is chosen by an adversary after we fix our policy. As a running example, we consider a rob...

H. Brendan McMahan, Geoffrey J. Gordon, Avrim Blum

claim paper

Read More »

120

click to vote

KDD
2006
ACM

163views Data Mining» more KDD 2006»

New EM derived from Kullback-Leibler divergence

16 years 5 months ago

Download www.cis.temple.edu

We introduce a new EM framework in which it is possible not only to optimize the model parameters but also the number of model components. A key feature of our approach is that we...

Longin Jan Latecki, Marc Sobel, Rolf Lakämper

claim paper

Read More »

155

click to vote

AIIA
2007
Springer

147views Artificial Intelligence» more AIIA 2007»

Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions

15 years 11 months ago

Download sequel.futurs.inria.fr

The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...

Andrea Bonarini, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

156

click to vote

ATAL
2007
Springer

122views Intelligent Agents» more ATAL 2007»

Reducing the complexity of multiagent reinforcement learning

15 years 11 months ago

Download www.damas.ift.ulaval.ca

It is known that the complexity of the reinforcement learning algorithms, such as Q-learning, may be exponential in the number of environment’s states. It was shown, however, th...

Andriy Burkov, Brahim Chaib-draa

claim paper

Read More »

149

click to vote

ROBOCUP
2007
Springer

153views Robotics» more ROBOCUP 2007»

Model-Based Reinforcement Learning in a Complex Domain

15 years 10 months ago

Download userweb.cs.utexas.edu

Reinforcement learning is a paradigm under which an agent seeks to improve its policy by making learning updates based on the experiences it gathers through interaction with the en...

Shivaram Kalyanakrishnan, Peter Stone, Yaxin Liu

claim paper

Read More »

« Prev « First page 852 / 995 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers