Search Sciweavers | Sciweavers

111

NIPS
2007

146views Information Technology» more NIPS 2007»

Optimistic Linear Programming gives Logarithmic Regret for Irreducible MDPs

15 years 3 months ago

We present an algorithm called Optimistic Linear Programming (OLP) for learning to optimize average reward in an irreducible but otherwise unknown Markov decision process (MDP). O...

Ambuj Tewari, Peter L. Bartlett

claim paper

Read More »

144

click to vote

AAAI
2006

127views Intelligent Agents» more AAAI 2006»

Reinforcement Learning with Human Teachers: Evidence of Feedback and Guidance with Implications for Learning Performance

15 years 3 months ago

Download robotic.media.mit.edu

As robots become a mass consumer product, they will need to learn new skills by interacting with typical human users. Past approaches have adapted reinforcement learning (RL) to a...

Andrea Lockerd Thomaz, Cynthia Breazeal

claim paper

Read More »

135

click to vote

COLT
2010
Springer

129views Machine Learning» more COLT 2010»

Nonparametric Bandits with Covariates

14 years 11 months ago

Download www.princeton.edu

We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random cov...

Philippe Rigollet, Assaf Zeevi

claim paper

Read More »

104

Voted

ICSOC
2007
Springer

109views Applied Computing» more ICSOC 2007»

Negotiation of Service Level Agreements: An Architecture and a Search-Based Approach

15 years 8 months ago

Download www.rcost.unisannio.it

Software systems built by composing existing services are more and more capturing the interest of researchers and practitioners. The envisaged long term scenario is that services, ...

Elisabetta Di Nitto, Massimiliano Di Penta, Alessi...

claim paper

Read More »

112

click to vote

PRICAI
1999
Springer

135views Artificial Intelligence» more PRICAI 1999»

Making Rational Decisions in N-by-N Negotiation Games with a Trusted Third Party

15 years 6 months ago

Download www.csie.cyut.edu.tw

The optimal decision for an agent to make at a given game situation often depends on the decisions that other agents make at the same time. Rational agents will try to find a stabl...

Shih-Hung Wu, Von-Wun Soo

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers