Sciweavers

ACSC
2009
IEEE

A ConceptLink Graph for Text Structure Mining

14 years 7 months ago
A ConceptLink Graph for Text Structure Mining
Most text mining methods are based on representing documents using a vector space model, commonly known as a bag of word model, where each document is modeled as a linear vector representing the occurrence of independent words in the text corpus. It is well known that using this vector-based representation, important information, such as semantic relationship among concepts, is lost. This paper proposes a novel text representation model called ConceptLink graph. The ConceptLink graph does not only represent the content of the document, but also captures some of its underlying semantic structure in terms of the relationships among concepts. The ConceptLink graph is constructed in two main stages. First, we find a set of concepts by clustering conceptually related terms using the self-organizing map method. Secondly, by mapping each document’s content to concept, we generate a graph of concepts based on the occurrences of concepts using a singular value decomposition technique. The C...
Rowena Chau, Ah Chung Tsoi, Markus Hagenbuchner, V
Added 18 May 2010
Updated 18 May 2010
Type Conference
Year 2009
Where ACSC
Authors Rowena Chau, Ah Chung Tsoi, Markus Hagenbuchner, Vincent Lee
Comments (0)