Search Sciweavers | Sciweavers

499 search results - page 26 / 100

» Model Minimization in Markov Decision Processes

click to vote

ICASSP
2009
IEEE

123views Signal Processing» more ICASSP 2009»

Combining mixture weight pruning and quantization for small-footprint speech recognition

14 years 3 months ago

Download www.cs.cmu.edu

Semi-continuous acoustic models, where the output distributions for all Hidden Markov Model states share a common codebook of Gaussian density functions, are a well-known and prov...

David Huggins-Daines, Alexander I. Rudnicky

claim paper

Read More »

click to vote

Publication

233views

Sparse reward processes

12 years 7 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

ATAL
2007
Springer

145views Intelligent Agents» more ATAL 2007»

Interactive dynamic influence diagrams

14 years 21 days ago

Download www.sci.brooklyn.cuny.edu

This paper extends the framework of dynamic influence diagrams (DIDs) to the multi-agent setting. DIDs are computational representations of the Partially Observable Markov Decisio...

Kyle Polich, Piotr J. Gmytrasiewicz

claim paper

Read More »

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

13 years 7 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

click to vote

AAAI
2010

144views Intelligent Agents» more AAAI 2010»

Representation Discovery in Sequential Decision Making

13 years 10 months ago

Download www.cs.umass.edu

Automatically constructing novel representations of tasks from analysis of state spaces is a longstanding fundamental challenge in AI. I review recent progress on this problem for...

Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 26 / 100 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers