Sciweavers

35 search results - page 5 / 7
» Document centered approach to text normalization
Sort
View
CICLING
2010
Springer
13 years 12 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
CLEF
2007
Springer
14 years 2 months ago
Using Recognizing Textual Entailment as a Core Engine for Answer Validation
This paper is about our approach to answer validation, which centered by a Recognizing Textual Entailment (RTE) core engine. We first combined the question and the answer into Hypo...
Rui Wang, Günter Neumann
DOCENG
2004
ACM
14 years 1 months ago
Creation of topic map by identifying topic chain in chinese
XML Topic maps enable multiple, concurrent views of sets of information objects and can be used to different applications. For example, thesaurus-like interfaces to corpora, navig...
Ching-Long Yeh, Yi-Chun Chen
ANLP
2000
157views more  ANLP 2000»
13 years 9 months ago
Experiments on Sentence Boundary Detection
This paper explores the problem of identifying sentence boundaries in the transcriptions produced by automatic speech recognition systems. An experiment which determines the level...
Mark Stevenson, Robert J. Gaizauskas
WWW
2004
ACM
14 years 8 months ago
Automatically collecting, monitoring, and mining japanese weblogs
We present a system that tries to automatically collect and monitor Japanese blog collections that include not only ones made with blog softwares but also ones written as normal w...
Tomoyuki Nanno, Toshiaki Fujiki, Yasuhiro Suzuki, ...