Sciweavers

252 search results - page 32 / 51
» Learning Partially Observable Action Models: Efficient Algor...
Sort
View
135
Voted
INFOCOM
2012
IEEE
13 years 5 months ago
Approximately optimal adaptive learning in opportunistic spectrum access
—In this paper we develop an adaptive learning algorithm which is approximately optimal for an opportunistic spectrum access (OSA) problem with polynomial complexity. In this OSA...
Cem Tekin, Mingyan Liu
134
Voted
ICML
2010
IEEE
15 years 16 days ago
Constructing States for Reinforcement Learning
POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...
M. M. Hassan Mahmud
179
Voted
WWW
2010
ACM
15 years 9 months ago
Factorizing personalized Markov chains for next-basket recommendation
Recommender systems are an important component of many websites. Two of the most popular approaches are based on matrix factorization (MF) and Markov chains (MC). MF methods learn...
Steffen Rendle, Christoph Freudenthaler, Lars Schm...
109
Voted
FLAIRS
2006
15 years 4 months ago
Managing Student Emotions in Intelligent Tutoring Systems
1 In the classic educational context, observing and identifying learner's emotional response allow the teacher to adapt the lesson, with the aim of improving the quality of th...
Roger Nkambou
118
Voted
ATAL
2007
Springer
15 years 8 months ago
Real-time agent characterization and prediction
Reasoning about agents that we observe in the world is challenging. Our available information is often limited to observations of the agent’s external behavior in the past and p...
H. Van Dyke Parunak, Sven Brueckner, Robert S. Mat...