This paper proposes a novel framework for automatic text categorization problem based on the kernel density classifier. The overall goal is to tackle two main issues in automatic ...
Dwi Sianto Mansjur, Ted S. Wada, Biing-Hwang Juang
Automatic discovery of semantic relations between resources is a key issue in Web-based intelligent applications such as document understanding and Web services. This paper explore...
XML is becoming a prevalent format for data exchange. Many XML documents have complex schemas that are not always known, and can vary widely between information sources and applica...
Eugene Agichtein, C. T. Howard Ho, Vanja Josifovsk...
E-Discovery is the process of discovering electronically stored information such as email that is relevant to a legal case. A typical ediscovery process incurs huge costs due to t...
Prasad M. Deshpande, Thomas Hampp, Manjula Hosurma...
This paper introduces the Book Structure Extraction competition run at ICDAR 2009. The goal of the competition is to evaluate and compare automatic techniques for deriving structu...
Antoine Doucet, Gabriella Kazai, Bodin Dresevic, A...