Search Sciweavers | Sciweavers

27

JMLR
2012

165views Programming Languages» more JMLR 2012»

PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits

11 years 11 months ago

We develop a new tool for data-dependent analysis of the exploration-exploitation trade-oﬀ in learning under limited feedback. Our tool is based on two main ingredients. The ﬁ...

Yevgeny Seldin, Nicolò Cesa-Bianchi, Peter ...

claim paper

Read More »

43

click to vote

AIIA
2007
Springer

147views Artificial Intelligence» more AIIA 2007»

Reinforcement Learning in Complex Environments Through Multiple Adaptive Partitions

14 years 3 months ago

Download sequel.futurs.inria.fr

The application of Reinforcement Learning (RL) algorithms to learn tasks for robots is often limited by the large dimension of the state space, which may make prohibitive its appli...

Andrea Bonarini, Alessandro Lazaric, Marcello Rest...

claim paper

Read More »

33

click to vote

NIPS
1998

164views Information Technology» more NIPS 1998»

Approximate Learning of Dynamic Models

13 years 10 months ago

Download robotics.stanford.edu

Inference is a key component in learning probabilistic models from partially observable data. When learning temporal models, each of the many inference phases requires a complete ...

Xavier Boyen, Daphne Koller

claim paper

Read More »

31

click to vote

AAAI
1994

185views Intelligent Agents» more AAAI 1994»

Learning to Coordinate without Sharing Information

13 years 10 months ago

Download www.agent.ai

Researchers in the eld of Distributed Arti cial Intelligence (DAI) have been developing e cient mechanisms to coordinate the activities of multiple autonomous agents. The need for...

Sandip Sen, Mahendra Sekaran, John Hale

claim paper

Read More »

31

click to vote

ICDM
2010
IEEE

122views Data Mining» more ICDM 2010»

Learning Preferences with Millions of Parameters by Enforcing Sparsity

13 years 6 months ago

Download www.cs.cmu.edu

We study the retrieval task that ranks a set of objects for a given query in the pairwise preference learning framework. Recently researchers found out that raw features (e.g. word...

Xi Chen, Bing Bai, Yanjun Qi, Qihang Lin, Jaime G....

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers