Text documents often contain valuable structured data that is hidden in regular English sentences. This data is best exploited if available as a relational table that we could use...
The goal of this work is to improve the accuracy (precision and recall) and communication effectiveness of a database system response to a user information request, by utilizing a...
The effects of word recognition errors (WRE) in Spoken Document Retrieval have been well studied and well reported in recent Information Retrieval (IR) literature. Much less exper...
We introduce a simple and efficient method for clustering and identifying temporal trends in hyper-linked document databases. Our method can scale to large datasets because it ex...
Alexandrin Popescul, Gary William Flake, Steve Law...
1 Electronic book is an application with a multimedia database of instructional resources, which include hyperlinked text, instructor’s audio/video clips, slides, animation, stil...
We address the problem of tight XML schemas and propose regular tree automata to model XML data. We show that the tree automata model is more powerful that the XML DTDs and is clo...
A mediator system is a kind of a meta-search engine that provides a seamlessly integrated search service for diverse search engines (collections). Since collections of a mediator ...
Information retrieval (IR) research has been very active over the last decades to develop approaches that allow machine indexing to significantly improve indexing practice in lib...