Search Sciweavers | Sciweavers

1166 search results - page 92 / 234

» Negotiating Using Rewards

175

click to vote

IJCAI
2007

187views Artificial Intelligence» more IJCAI 2007»

Forward Search Value Iteration for POMDPs

15 years 8 months ago

Download ijcai.org

Recent scaling up of POMDP solvers towards realistic applications is largely due to point-based methods which quickly converge to an approximate solution for medium-sized problems...

Guy Shani, Ronen I. Brafman, Solomon Eyal Shimony

claim paper

Read More »

177

click to vote

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

16 years 17 days ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

186

click to vote

NSDI
2010

239views Computer Networks» more NSDI 2010»

Contracts: Practical Contribution Incentives for P2P Live Streaming

15 years 8 months ago

Download www.cs.washington.edu

PPLive is a popular P2P video system used daily by millions of people worldwide. Achieving this level of scalability depends on users making contributions to the system, but curre...

Michael Piatek, Arvind Krishnamurthy, Arun Venkata...

claim paper

Read More »

186

click to vote

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

15 years 5 months ago

Download scalab.uc3m.es

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

177

click to vote

ICML
2003
IEEE

165views Machine Learning» more ICML 2003»

The Cross Entropy Method for Fast Policy Search

16 years 8 months ago

Download www.hpl.hp.com

We present a learning framework for Markovian decision processes that is based on optimization in the policy space. Instead of using relatively slow gradient-based optimization al...

Shie Mannor, Reuven Y. Rubinstein, Yohai Gat

claim paper

Read More »

« Prev « First page 92 / 234 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers