Sciweavers

2827 search results - page 431 / 566
» Marking Text Documents
Sort
View
INEX
2007
Springer
14 years 4 months ago
Phrase Detection in the Wikipedia
The Wikipedia XML collection turned out to be rich of marked-up phrases as we carried out our INEX 2007 experiments. Assuming that a phrase occurs at the inline level of the markup...
Miro Lehtonen, Antoine Doucet
SIGIR
2006
ACM
14 years 4 months ago
Distributed query sampling: a quality-conscious approach
We present an adaptive distributed query-sampling framework that is quality-conscious for extracting high-quality text database samples. The framework divides the query-based samp...
James Caverlee, Ling Liu, Joonsoo Bae
SOFTVIS
2005
ACM
14 years 4 months ago
Towards understanding programs through wear-based filtering
Large software projects often require a programmer to make changes to unfamiliar source code. This paper presents the results of a formative observational study of seven professio...
Robert DeLine, Amir Khella, Mary Czerwinski, Georg...
CIKM
2005
Springer
14 years 4 months ago
A hybrid approach to NER by MEMM and manual rules
This paper describes a framework for defining domain specific Feature Functions in a user friendly form to be used in a Maximum Entropy Markov Model (MEMM) for the Named Entity Re...
Moshe Fresko, Binyamin Rosenfeld, Ronen Feldman
CIKM
2005
Springer
14 years 4 months ago
Similarity measures for tracking information flow
Text similarity spans a spectrum, with broad topical similarity near one extreme and document identity at the other. Intermediate levels of similarity – resulting from summariza...
Donald Metzler, Yaniv Bernstein, W. Bruce Croft, A...