Sciweavers

1167 search results - page 120 / 234
» Relational Markov Games
Sort
View
IJCAI
2001
13 years 11 months ago
Symbolic Dynamic Programming for First-Order MDPs
We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...
Craig Boutilier, Raymond Reiter, Bob Price
CSDA
2007
94views more  CSDA 2007»
13 years 10 months ago
Some extensions of score matching
Many probabilistic models are only defined up to a normalization constant. This makes maximum likelihood estimation of the model parameters very difficult. Typically, one then h...
Aapo Hyvärinen
ICASSP
2011
IEEE
13 years 1 months ago
Large vocabulary continuous speech recognition with context-dependent DBN-HMMS
The context-independent deep belief network (DBN) hidden Markov model (HMM) hybrid architecture has recently achieved promising results for phone recognition. In this work, we pro...
George E. Dahl, Dong Yu, Li Deng, Alex Acero
PR
2011
13 years 28 days ago
Generalized darting Monte Carlo
One of the main shortcomings of Markov chain Monte Carlo samplers is their inability to mix between modes of the target distribution. In this paper we show that advance knowledge ...
Cristian Sminchisescu, Max Welling
ATAL
2005
Springer
14 years 3 months ago
Theory of moves learners: towards non-myopic equilibria
In contrast to classical game theoretic analysis of simultaneous and sequential play in bimatrix games, Steven Brams has proposed an alternative framework called the Theory of Mov...
Arjita Ghosh, Sandip Sen