This paper presents a design and an implementation of document image retrieval to support reading mokkans. A mokkan is a wooden tablet with text written by a brush in India ink. D...
Akihito Kitadai, Jun Takakura, Masatoshi Ishikawa,...
The re-use of spoken word audio collections maintained by audiovisual archives is severely hindered by their generally limited access. The CHoral project, which is part of the CAT...
Willemijn Heeren, Franciska de Jong, Laurens van d...
Retrieving documents by subject matter is the general goal of information retrieval and other content access systems. There are other aspects of textual content, however, which fo...
This paper describes the development of a structured document collection containing user-generated text and numerical metadata for exploring the exploitation of metadata in inform...
Walid Magdy, Jinming Min, Johannes Leveling, Garet...
This paper presents work done at Cambridge University for the TREC-9 Spoken Document Retrieval (SDR) track. The CUHTK transcriptions from TREC-8 with Word Error Rate (WER) of 20.5...
Sue E. Johnson, P. Jourlin, Karen Sparck Jones, Ph...