Search Sciweavers | Sciweavers

236 search results - page 31 / 48

» Confidence-based policy learning from demonstration using Ga...

147

click to vote

JMLR
2010

154views more JMLR 2010»

Infinite Predictor Subspace Models for Multitask Learning

15 years 19 days ago

Download jmlr.csail.mit.edu

Given several related learning tasks, we propose a nonparametric Bayesian model that captures task relatedness by assuming that the task parameters (i.e., predictors) share a late...

Piyush Rai, Hal Daumé III

claim paper

Read More »

173

click to vote

KDD
2004
ACM

181views Data Mining» more KDD 2004»

Column-generation boosting methods for mixture of kernels

16 years 6 months ago

Download stat.rutgers.edu

We devise a boosting approach to classification and regression based on column generation using a mixture of kernels. Traditional kernel methods construct models based on a single...

Jinbo Bi, Tong Zhang, Kristin P. Bennett

claim paper

Read More »

140

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

16 years 6 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

181

click to vote

UAI
2000

127views Artificial Intelligence» more UAI 2000»

Utilities as Random Variables: Density Estimation and Structure Discovery

15 years 7 months ago

Download ai.stanford.edu

Decision theory does not traditionally include uncertainty over utility functions. We argue that the a person's utility value for a given outcome can be treated as we treat o...

Urszula Chajewska, Daphne Koller

claim paper

Read More »

171

click to vote

ICRA
2010
IEEE

133views Robotics» more ICRA 2010»

Generalized model learning for Reinforcement Learning on a humanoid robot

15 years 4 months ago

Download www.cs.utexas.edu

— Reinforcement learning (RL) algorithms have long been promising methods for enabling an autonomous robot to improve its behavior on sequential decision-making tasks. The obviou...

Todd Hester, Michael Quinlan, Peter Stone

claim paper

Read More »

« Prev « First page 31 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers