Abstract. In this paper we describe a methodology for harvesting information from large distributed repositories (e.g. large Web sites) with minimum user intervention. The methodol...
Fabio Ciravegna, Sam Chapman, Alexiei Dingli, Yori...
In this paper, we present an automated text classification system for the classification of biomedical papers. This classification is based on whether there is experimental eviden...
Min Shi, David S. Edwin, Rakesh Menon, Lixiang She...
This paper presents our experiments in question answering for speech corpora. These experiments focus on improving the answer extraction step of the QA process. We present two app...
Typographic and visual information is an integral part of textual documents. Most information extraction systems ignore most of this visual information, processing the text as a l...
We describe a trainable and scalable summarization system which utilizes features derived from information retrieval, information extraction, and NLP techniques and on-line resour...
Chinatsu Aone, Mary Ellen Okurowski, James Gorlins...