We describe ongoing research on segmenting and labeling HTML medical journal articles. In contrast to existing approaches in which HTML tags usually serve as strong indicators, we...
The use of the computing with words paradigm for the automatic text documents categorization problem is discussed. This specific problem of information retrieval (IR) becomes more...
York University evaluated a prepcessing approach for this year’s enterprise document search task. With different parsing tools, we create two data sets. Based on each data set,...
This paper shows how to use a text retrieval and an image retrieval engine in a cooperative way. The proposed Inter-Media PseudoRelevance Feedback approach shows how the image moda...
Nicolas Maillot, Jean-Pierre Chevallet, Joo-Hwee L...
This paper addresses the challenging problem of similarity search over widely distributed ultra-high dimensional data. Such an application is retrieval of the top-k most similar d...