This paper addresses the problem of classification in situations where the data distribution is not homogeneous: Data instances might come from different locations or times, and t...
Semantic integration in the hidden Web is an emerging area of research where traditional assumptions do not always hold. Frequent changes, conflicts and the sheer size of the hid...
Abstract. We outline a framework for managing information quality (IQ) in eScience, using ontologies, semantic annotation of resources, and data bindings. Scientists define the qua...
Alun D. Preece, Binling Jin, Edoardo Pignotti, Pao...
We address the problem of identifying the domain of online databases. More precisely, given a set F of Web forms automatically gathered by a focused crawler and an online database...
This paper investigates a machine learning approach for temporally ordering and anchoring events in natural language texts. To address data sparseness, we used temporal reasoning ...
Inderjeet Mani, Marc Verhagen, Ben Wellner, Chong ...