Because of the increasing number of electronic data, designing efficient tools to retrieve and exploit documents is a major challenge. Current search engines suffer from two main d...
Sylvie Ranwez, Vincent Ranwez, Mohameth-Fran&ccedi...
In Africa, there are a number of languages with their own indigenous scripts. This paper presents an OCR for Amharic scripts. Amharic is the official and working language of Ethio...
Research in natural language generation promises significant advances in the ways in which we can make available the contents of underlying information sources. Most work in the f...
Robert Dale, Stephen J. Green, Maria Milosavljevic...
We show that the e-commerce domain can provide all the right ingredients for successful data mining. We describe an integrated architecture for supporting this integration. The ar...
Suhail Ansari, Ron Kohavi, Llew Mason, Zijian Zhen...
We propose an approach to restore severely degraded
document images using a probabilistic context model. Un-
like traditional approaches that use previously learned
prior models...
Jyotirmoy Banerjee, Anoop M. Namboodiri, C. V. Jaw...