As web forum has become an enormous collection of highly valuable opinions and commentaries, more and more researchers express strong interests on it. However, most of them pay at...
Abstract. Topic models are a discrete analogue to principle component analysis and independent component analysis that model topic at the word level within a document. They have ma...
We present a topic boundary detection method that searches for connections between sequences of utterances in multi party dialogues. The connections are established based on word ...
In bibliographies like DBLP and Citeseer, there are three kinds of entity-name problems that need to be solved. First, multiple entities share one name, which is called the name sh...
In order to solve problems of reliability of systems based on lexical repetition and problems of adaptability of languagedependent systems, we present a context-based topic segmen...