Sciweavers

1799 search results - page 233 / 360
» Filtered Reinforcement Learning
Sort
View
144
Voted
ACSE
2000
ACM
15 years 7 months ago
The information environments program - a new design based IT degree
The University of Queensland has recently established a new design-focused, studio-based IT degree at a new “flexible-learning” campus. The Bachelor of Information Environment...
Michael Docherty, Peter Sutton, Margot Brereton, S...
126
Voted
ICCS
1993
Springer
15 years 6 months ago
Towards Domain-Independent Machine Intelligence
Adaptive predictive search (APS), is a learning system framework, which given little initial domain knowledge, increases its decision-making abilities in complex problems domains....
Robert Levinson
138
Voted
NIPS
2008
15 years 4 months ago
Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms
Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...
John W. Roberts, Russ Tedrake
TREC
2001
15 years 4 months ago
The Bias Problem and Language Models in Adaptive Filtering
We used the YFILTER filtering system for experiments on updating profiles and setting thresholds. We developed a new method of using language models for updating profiles that is ...
Yi Zhang 0001, James P. Callan
CORR
2004
Springer
122views Education» more  CORR 2004»
15 years 2 months ago
"In vivo" spam filtering: A challenge problem for data mining
Spam, also known as Unsolicited Commercial Email (UCE), is the bane of email communication. Many data mining researchers have addressed the problem of detecting spam, generally by...
Tom Fawcett