Search Sciweavers | Sciweavers

246 search results - page 6 / 50

» On Optimal Learning Algorithms for Multiplicity Automata

click to vote

IJCAI
2003

130views Artificial Intelligence» more IJCAI 2003»

Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

13 years 8 months ago

Download www.cc.gatech.edu

We present a new algorithm, GM-Sarsa(0), for ﬁnding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processe...

Nathan Sprague, Dana H. Ballard

claim paper

Read More »

click to vote

AAAI
1996

118views Intelligent Agents» more AAAI 1996»

Learning Models of Intelligent Agents

13 years 8 months ago

Download www.cs.technion.ac.il

Agents that operate in a multi-agent system need an efficient strategy to handle their encounters with other agents involved. Searching for an optimal interactive strategy is a ha...

David Carmel, Shaul Markovitch

claim paper

Read More »

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Identification of weak motifs in multiple biological sequences using genetic algorithm

13 years 11 months ago

Download www.cs.york.ac.uk

Recognition of motifs in multiple unaligned sequences provides an insight into protein structure and function. The task of discovering these motifs is very challenging because mos...

Topon Kumar Paul, Hitoshi Iba

claim paper

Read More »

click to vote

ICML
2009
IEEE

160views Machine Learning» more ICML 2009»

A convex formulation for learning shared structures from multiple tasks

14 years 8 months ago

Download www.public.asu.edu

Multi-task learning (MTL) aims to improve generalization performance by learning multiple related tasks simultaneously. In this paper, we consider the problem of learning shared s...

Jianhui Chen, Lei Tang, Jun Liu, Jieping Ye

claim paper

Read More »

click to vote

COLT
2010
Springer

217views Machine Learning» more COLT 2010»

Optimal Algorithms for Online Convex Optimization with Multi-Point Bandit Feedback

13 years 5 months ago

Download www.eecs.berkeley.edu

Bandit convex optimization is a special case of online convex optimization with partial information. In this setting, a player attempts to minimize a sequence of adversarially gen...

Alekh Agarwal, Ofer Dekel, Lin Xiao

claim paper

Read More »

« Prev « First page 6 / 50 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers