Search Sciweavers | Sciweavers

142

ICML
2010
IEEE

167views Machine Learning» more ICML 2010»

Internal Rewards Mitigate Agent Boundedness

15 years 8 months ago

Abstract--Reinforcement learning (RL) research typically develops algorithms for helping an RL agent best achieve its goals-however they came to be defined--while ignoring the rela...

Jonathan Sorg, Satinder P. Singh, Richard Lewis

claim paper

Read More »

204

click to vote

ICML
2010
IEEE

200views Machine Learning» more ICML 2010»

Generalizing Apprenticeship Learning across Hypothesis Classes

15 years 8 months ago

Download paul.rutgers.edu

This paper develops a generalized apprenticeship learning protocol for reinforcementlearning agents with access to a teacher who provides policy traces (transition and reward obse...

Thomas J. Walsh, Kaushik Subramanian, Michael L. L...

claim paper

Read More »

221

Voted

ICML
2010
IEEE

279views Machine Learning» more ICML 2010»

Learning Temporal Causal Graphs for Relational Time-Series Analysis

15 years 8 months ago

Download www.icml2010.org

Learning temporal causal graph structures from multivariate time-series data reveals important dependency relationships between current observations and histories, and provides a ...

Yan Liu 0002, Alexandru Niculescu-Mizil, Aurelie C...

claim paper

Read More »

169

click to vote

ICML
2010
IEEE

240views Machine Learning» more ICML 2010»

Cognitive Models of Test-Item Effects in Human Category Learning

15 years 8 months ago

Download pages.cs.wisc.edu

Imagine two identical people receive exactly the same training on how to classify certain objects. Perhaps surprisingly, we show that one can then manipulate them into classifying...

Xiaojin Zhu, Bryan R. Gibson, Kwang-Sung Jun, Timo...

claim paper

Read More »

203

Voted

ICML
2010
IEEE

233views Machine Learning» more ICML 2010»

Hilbert Space Embeddings of Hidden Markov Models

15 years 8 months ago

Download www.icml2010.org

Hidden Markov Models (HMMs) are important tools for modeling sequence data. However, they are restricted to discrete latent states, and are largely restricted to Gaussian and disc...

Le Song, Sajid M. Siddiqi, Geoffrey J. Gordon, Ale...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers