Search Sciweavers | Sciweavers

3694 search results - page 147 / 739

» Stochastic complexity in learning

142

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Approximate Learning of Dynamic Models

15 years 4 months ago

Download robotics.stanford.edu

Inference is a key component in learning probabilistic models from partially observable data. When learning temporal models, each of the many inference phases requires a complete ...

Xavier Boyen, Daphne Koller

claim paper

Read More »

126

Voted

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

15 years 4 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

135

click to vote

ICDM
2010
IEEE

122views Data Mining» more ICDM 2010»

Learning Preferences with Millions of Parameters by Enforcing Sparsity

15 years 27 days ago

Download www.cs.cmu.edu

We study the retrieval task that ranks a set of objects for a given query in the pairwise preference learning framework. Recently researchers found out that raw features (e.g. word...

Xi Chen, Bing Bai, Yanjun Qi, Qihang Lin, Jaime G....

claim paper

Read More »

141

click to vote

LAMAS
2005
Springer

168views Intelligent Agents» more LAMAS 2005»

Multi-agent Relational Reinforcement Learning

15 years 8 months ago

Download dtai.cs.kuleuven.be

In this paper we report on using a relational state space in multi-agent reinforcement learning. There is growing evidence in the Reinforcement Learning research community that a r...

Tom Croonenborghs, Karl Tuyls, Jan Ramon, Maurice ...

claim paper

Read More »

125

Voted

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 4 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

« Prev « First page 147 / 739 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers