Search Sciweavers | Sciweavers

282 search results - page 33 / 57

» Online Learning of Approximate Dependency Parsing Algorithms

149

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Transfer via soft homomorphisms

15 years 10 months ago

Download www.eecs.umich.edu

The ﬁeld of transfer learning aims to speed up learning across multiple related tasks by transferring knowledge between source and target tasks. Past work has shown that when th...

Jonathan Sorg, Satinder Singh

claim paper

Read More »

133

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 5 months ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

154

click to vote

ICASSP
2011
IEEE

204views Signal Processing» more ICASSP 2011»

Bayesian reinforcement learning for POMDP-based dialogue systems

14 years 7 months ago

Download mirlab.org

Spoken dialogue systems are gaining popularity with improvements in speech recognition technologies. Dialogue systems can be modeled effectively using POMDPs, achieving improvemen...

ShaoWei Png, Joelle Pineau

claim paper

Read More »

140

click to vote

DAC
2011
ACM

162views Computer Architecture» more DAC 2011»

Supervised design space exploration by compositional approximation of Pareto sets

14 years 3 months ago

Download www.cs.columbia.edu

Technology scaling allows the integration of billions of transistors on the same die but CAD tools struggle in keeping up with the increasing design complexity. Design productivit...

Hung-Yi Liu, Ilias Diakonikolas, Michele Petracca,...

claim paper

Read More »

149

click to vote

AAAI
2007

124views Intelligent Agents» more AAAI 2007»

COD: Online Temporal Clustering for Outbreak Detection

15 years 6 months ago

Download www.cs.pitt.edu

We present Cluster Onset Detection (COD), a novel algorithm to aid in detection of epidemic outbreaks. COD employs unsupervised learning techniques in an online setting to partiti...

Tomás Singliar, Denver Dash

claim paper

Read More »

« Prev « First page 33 / 57 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers