Sciweavers

5580 search results - page 977 / 1116
» Randomized priority algorithms
Sort
View
ATAL
2006
Springer
14 years 29 days ago
Probabilistic policy reuse in a reinforcement learning agent
We contribute Policy Reuse as a technique to improve a reinforcement learning agent with guidance from past learned similar policies. Our method relies on using the past policies ...
Fernando Fernández, Manuela M. Veloso
ATAL
2006
Springer
14 years 29 days ago
Learning a common language through an emergent interaction topology
We study the effects of various emergent topologies of interaction on the rate of language convergence in a population of communicating agents. The agents generate, parse, and lea...
Samarth Swarup, Kiran Lakkaraju, Les Gasser
CHES
2006
Springer
134views Cryptology» more  CHES 2006»
14 years 29 days ago
Read-Proof Hardware from Protective Coatings
In cryptography it is assumed that adversaries only have black box access to the secret keys of honest parties. In real life, however, the black box approach is not sufficient beca...
Pim Tuyls, Geert Jan Schrijen, Boris Skoric, Jan v...
CIVR
2006
Springer
219views Image Analysis» more  CIVR 2006»
14 years 29 days ago
Bayesian Learning of Hierarchical Multinomial Mixture Models of Concepts for Automatic Image Annotation
We propose a novel Bayesian learning framework of hierarchical mixture model by incorporating prior hierarchical knowledge into concept representations of multi-level concept struc...
Rui Shi, Tat-Seng Chua, Chin-Hui Lee, Sheng Gao
FOCS
2004
IEEE
14 years 29 days ago
Stochastic Optimization is (Almost) as easy as Deterministic Optimization
Stochastic optimization problems attempt to model uncertainty in the data by assuming that (part of) the input is specified in terms of a probability distribution. We consider the...
David B. Shmoys, Chaitanya Swamy