Search Sciweavers | Sciweavers

108

RAS
2010

131views more RAS 2010»

Probabilistic Policy Reuse for inter-task transfer learning

15 years 24 days ago

Policy Reuse is a reinforcement learning technique that eﬃciently learns a new policy by using past similar learned policies. The Policy Reuse learner improves its exploration b...

Fernando Fernández, Javier García, M...

claim paper

Read More »

121

click to vote

CORR
2004
Springer

103views Education» more CORR 2004»

Online convex optimization in the bandit setting: gradient descent without a gradient

15 years 2 months ago

Download www.cs.cmu.edu

We study a general online convex optimization problem. We have a convex set S and an unknown sequence of cost functions c1, c2, . . . , and in each period, we choose a feasible po...

Abraham Flaxman, Adam Tauman Kalai, H. Brendan McM...

claim paper

Read More »

129

Voted

QEST
2005
IEEE

82views Modeling and Simulation» more QEST 2005»

Toward Picture-perfect Streaming on the Internet

15 years 8 months ago

Download www.cse.cuhk.edu.hk

Quality of service (QoS) in streaming of continuous media over the Internet is poor, which is partly due to variations in delays, bandwidth limitations, and packet losses. Althoug...

Alix L. H. Chow, Leana Golubchik, John C. S. Lui

claim paper

Read More »

121

Voted

IJCAI
2007

271views Artificial Intelligence» more IJCAI 2007»

Adaptive Genetic Algorithm with Mutation and Crossover Matrices

15 years 3 months ago

Download repository.ust.hk

A matrix formulation for an adaptive genetic algorithm is developed using mutation matrix and crossover matrix. Selection, mutation, and crossover are all parameter-free in the se...

Nga Lam Law, Kwok Yip Szeto

claim paper

Read More »

103

click to vote

MEDINFO
2007

87views Healthcare» more MEDINFO 2007»

Text Categorization Models for Identifying Unproven Cancer Treatments on the Web

15 years 3 months ago

Download www.hon.ch

The nature of the internet as a non-peer-reviewed (and more generally largely unregulated) publication medium has allowed wide-spread promotion of inaccurate and unproven medical ...

Yin Aphinyanaphongs, Constantin F. Aliferis

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers