Sciweavers

406 search results - page 67 / 82
» Toward a Value Framework for Online Learning Systems
Sort
View
ATAL
2004
Springer
14 years 23 days ago
Unifying Temporal and Structural Credit Assignment Problems
Single-agent reinforcement learners in time-extended domains and multi-agent systems share a common dilemma known as the credit assignment problem. Multi-agent systems have the st...
Adrian K. Agogino, Kagan Tumer
AAAI
2008
13 years 9 months ago
Economic Hierarchical Q-Learning
Hierarchical state decompositions address the curse-ofdimensionality in Q-learning methods for reinforcement learning (RL) but can suffer from suboptimality. In addressing this, w...
Erik G. Schultink, Ruggiero Cavallo, David C. Park...
ICALP
2011
Springer
12 years 10 months ago
New Algorithms for Learning in Presence of Errors
We give new algorithms for a variety of randomly-generated instances of computational problems using a linearization technique that reduces to solving a system of linear equations...
Sanjeev Arora, Rong Ge
EMNLP
2010
13 years 5 months ago
Function-Based Question Classification for General QA
In contrast with the booming increase of internet data, state-of-art QA (question answering) systems, otherwise, concerned data from specific domains or resources such as search e...
Fan Bu, Xingwei Zhu, Yu Hao, Xiaoyan Zhu
KDD
2008
ACM
159views Data Mining» more  KDD 2008»
14 years 7 months ago
Semi-supervised learning with data calibration for long-term time series forecasting
Many time series prediction methods have focused on single step or short term prediction problems due to the inherent difficulty in controlling the propagation of errors from one ...
Haibin Cheng, Pang-Ning Tan