Sciweavers

44 search results - page 5 / 9
» Creating an Annotated Tamil Corpus as a Discourse Resource
Sort
View
LREC
2008
117views Education» more  LREC 2008»
13 years 8 months ago
Chooser: a Multi-Task Annotation Tool
The paper presents a tool assisting manual annotation of linguistic data developed at the Department of Computational linguistics, IBL-BAS. Chooser is a general-purpose modular ap...
Svetla Koeva, Borislav Rizov, Svetlozara Leseva
LREC
2010
182views Education» more  LREC 2010»
13 years 8 months ago
Wikicorpus: A Word-Sense Disambiguated Multilingual Wikipedia Corpus
This article presents a new freely available trilingual corpus (Catalan, Spanish, English) that contains large portions of the Wikipedia and has been automatically enriched with l...
Samuel Reese, Gemma Boleda, Montse Cuadros, Llu&ia...
MLMI
2005
Springer
14 years 17 days ago
The AMI Meeting Corpus: A Pre-announcement
Abstract. The AMI Meeting Corpus is a multi-modal data set consisting of 100 hours of meeting recordings. It is being created in the context of a project that is developing meeting...
Jean Carletta, Simone Ashby, Sebastien Bourban, Mi...
LREC
2010
143views Education» more  LREC 2010»
13 years 8 months ago
Towards a Large Parallel Corpus of Cleft Constructions
We present our efforts to create a large-scale, semi-automatically annotated parallel corpus of cleft constructions. The corpus is intended to reduce or make more effective the ma...
Gerlof Bouma, Lilja Øvrelid, Jonas Kuhn
CLEAR
2007
Springer
117views Biometrics» more  CLEAR 2007»
14 years 1 months ago
Shared Linguistic Resources for the Meeting Domain
This paper describes efforts by the University of Pennsylvania's Linguistic Data Consortium to create and distribute shared linguistic resources – including data, annotation...
Meghan Lammie Glenn, Stephanie Strassel