Sciweavers

1176 search results - page 82 / 236
» In the News
Sort
View
AINA
2009
IEEE
14 years 6 months ago
CUTER: An Efficient Useful Text Extraction Mechanism
In this paper we present CUTER, a system that processes HTML pages in order to extract the useful text from them. The mechanism is focalized on HTML pages that include news articl...
George Adam, Christos Bouras, Vassilis Poulopoulos
CSL
2004
Springer
13 years 11 months ago
Contemporaneous text as side-information in statistical language modeling
We propose new methods to exploit contemporaneous text, such as on-line news articles, to improve language models for automatic speech recognition and other natural language proce...
Sanjeev Khudanpur, Woosung Kim
VLDB
2002
ACM
132views Database» more  VLDB 2002»
13 years 10 months ago
An Automated System for Web Portal Personalization
This paper proposes a system for personalization of web portals. A speci c implementation is discussed in reference to a web portal containing a news feed service. Techniques are ...
Charu C. Aggarwal, Philip S. Yu
IADIS
2009
13 years 9 months ago
Trash article detection using categorization techniques
We explore techniques for detecting news articles containing invalid information, using the help of text categorization technology. The information that exists on the World Wide W...
Christos Bouras, Vassilis Tsogkas, Vassilis Poulop...
ICIW
2009
IEEE
13 years 9 months ago
Utilizing RSS Feeds for Crawling the Web
We present "advaRSS" crawling mechanism which is created in order to support peRSSonal, a mechanism used to create personalized RSS feeds. In contrast to the common crawl...
George Adam, Christos Bouras, Vassilis Poulopoulos