Search Sciweavers | Sciweavers

77 search results - page 9 / 16

» Learning While Optimizing an Unknown Fitness Surface

click to vote

ICML
2007
IEEE

200views Machine Learning» more ICML 2007»

Multi-task reinforcement learning: a hierarchical Bayesian approach

14 years 8 months ago

Download www.machinelearning.org

We consider the problem of multi-task reinforcement learning, where the agent needs to solve a sequence of Markov Decision Processes (MDPs) chosen randomly from a fixed but unknow...

Aaron Wilson, Alan Fern, Soumya Ray, Prasad Tadepa...

claim paper

Read More »

click to vote

AAAI
2010

214views Intelligent Agents» more AAAI 2010»

Multi-Instance Dimensionality Reduction

13 years 9 months ago

Download cs.nju.edu.cn

Multi-instance learning deals with problems that treat bags of instances as training examples. In single-instance learning problems, dimensionality reduction is an essential step ...

Yu-Yin Sun, Michael K. Ng, Zhi-Hua Zhou

claim paper

Read More »

click to vote

INFOCOM
2010
IEEE

207views Communications» more INFOCOM 2010»

Opportunistic Spectrum Access with Multiple Users: Learning under Competition

13 years 6 months ago

Download www.mit.edu

Abstract—The problem of cooperative allocation among multiple secondary users to maximize cognitive system throughput is considered. The channel availability statistics are initi...

Animashree Anandkumar, Nithin Michael, Ao Tang

claim paper

Read More »

click to vote

ICML
2006
IEEE

90views Machine Learning» more ICML 2006»

Learning algorithms for online principal-agent problems (and selling goods online)

14 years 8 months ago

Download www.cs.duke.edu

In a principal-agent problem, a principal seeks to motivate an agent to take a certain action beneficial to the principal, while spending as little as possible on the reward. This...

Vincent Conitzer, Nikesh Garera

claim paper

Read More »

click to vote

CORR
2008
Springer

189views Education» more CORR 2008»

Algorithms for Dynamic Spectrum Access with Learning for Cognitive Radio

13 years 7 months ago

Download www.ifp.illinois.edu

We study the problem of dynamic spectrum sensing and access in cognitive radio systems as a partially observed Markov decision process (POMDP). A group of cognitive users cooperati...

Jayakrishnan Unnikrishnan, Venugopal V. Veeravalli

claim paper

Read More »

« Prev « First page 9 / 16 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers