Sciweavers

210 search results - page 22 / 42
» Automatic web query classification using labeled and unlabel...
Sort
View
130
Voted
UAI
2008
15 years 5 months ago
Multi-View Learning over Structured and Non-Identical Outputs
In many machine learning problems, labeled training data is limited but unlabeled data is ample. Some of these problems have instances that can be factored into multiple views, ea...
Kuzman Ganchev, João Graça, John Bli...
IAT
2007
IEEE
15 years 10 months ago
An Intelligent Web Agent to Mine Bilingual Parallel Pages via Automatic Discovery of URL Pairing Patterns
This paper describes an intelligent agent to facilitate bitext mining from the Web via automatic discovery of URL pairing patterns (or keys) for retrieving parallel web pages. The...
Chunyu Kit, Jessica Yee Ha Ng
144
Voted
WWW
2006
ACM
16 years 4 months ago
Large-scale text categorization by batch mode active learning
Large-scale text categorization is an important research topic for Web data mining. One of the challenges in large-scale text categorization is how to reduce the amount of human e...
Steven C. H. Hoi, Rong Jin, Michael R. Lyu
134
Voted
ICML
2004
IEEE
16 years 4 months ago
Semi-supervised learning using randomized mincuts
In many application domains there is a large amount of unlabeled data but only a very limited amount of labeled training data. One general approach that has been explored for util...
Avrim Blum, John D. Lafferty, Mugizi Robert Rweban...
126
Voted
AI
2007
Springer
15 years 9 months ago
Learning the Semantic Meaning of a Concept from the Web
Many researchers have used text classification method in solving the ontology mapping problem. Their mapping results heavily depend on the availability of quality exemplars used as...
Yang Yu, Yun Peng