Text mining techniques have been proposed for extracting protein names and their interactions. First, we have made improvements on existing methods for handling single word protein...
Kiho Hong, Junhyung Park, Jihoon Yang, Sungyong Pa...
This paper presents a summarization model based on the Universal Networking Language (UNL), which is a conceptual language for representing texts sentence by sentence, using seman...
Camilla Brandel Martins, Lucia Helena Machado Rino
Foreign name expressions written in Chinese characters are difficult to recognize since the sequence of characters represents the Chinese pronunciation of the name. This paper sug...
Abstract— In this paper we suggest a new approach to represent text document collections, integrating background knowledge to improve clustering effectiveness. Background knowled...
Sumo is a formalism for universal segmentation of text. Its purpose is to provide a framework for the creation of segmentation applications. It is called universal as the formalis...