Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

189

CICLING
2005
Springer

107views Natural Language Processing» more CICLING 2005»

Automatic Annotation of Corpora for Text Summarisation: A Comparative Study

16 years 5 days ago

Automatic Annotation of Corpora for Text Summarisation: A Comparative Study

Download clg.wlv.ac.uk

This paper presents two methods which automatically produce annotated corpora for text summarisation on the basis of human abstracts. Both methods identify a set of sentences from the document which conveys the information in the human produced best. The ﬁrst method relies on a greedy algorithm, whilst the second one uses a genetic algorithm. The methods allow to specify the number of sentences to be annotated, which constitutes an advantage over the existing methods. Comparison between the two approaches investigated here revealed that the genetic algorithm is appropriate in cases where the number of sentences to be annotated is less than the number of sentences in an ideal gold standard with no length restrictions, whereas the greedy algorithm should be used in other cases.

Constantin Orasan

Real-time Traffic

CICLING 2005 | Genetic Algorithm | Greedy Algorithm | Natural Language Processing | ﬁrst Method Relies |

claim paper

Related Content

» An Automatic Close Copy Speech Synthesis Tool for LargeScale Speech Corpus Evaluation

» Comparative analysis of five proteinprotein interaction corpora

» Highlevel Features for Learning Subjective Language across Domains

» The DAD Parallel Corpora and their Uses

» Automatic phonetic transcription of large speech corpora

» Unsupervised Translation Induction for Chinese Abbreviations using Monolingual Corpora

» A Semantically Annotated Swedish Medical Corpus

» Text Categorization for Improved Priors of Word Meaning

» A probabilistic topicconnection model for automatic image annotation

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	CICLING
Authors	Constantin Orasan

Comments (0)