Sciweavers

76 search results - page 4 / 16
» Pseudo-Aligned Multilingual Corpora
Sort
View
LREC
2010
189views Education» more  LREC 2010»
13 years 6 months ago
NLGbAse: A Free Linguistic Resource for Natural Language Processing Systems
Availability of labeled language resources, such as annotated corpora and domain dependent labeled language resources is crucial for experiments in the field of Natural Language ...
Eric Charton, Juan Manuel Torres Moreno
CLEF
2011
Springer
12 years 7 months ago
A Language-Independent Approach to Identify the Named Entities in Under-Resourced Languages and Clustering Multilingual Document
Abstract. This paper presents a language-independent Multilingual Document Clustering (MDC) approach on comparable corpora. Named entites (NEs) such as persons, locations, organiza...
N. Kiran Kumar, G. S. K. Santosh, Vasudeva Varma
ACL
2012
11 years 10 months ago
ACCURAT Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora
The lack of parallel corpora and linguistic resources for many languages and domains is one of the major obstacles for the further advancement of automated translation. A possible...
Marcis Pinnis, Radu Ion, Dan Stefanescu, Fangzhong...
IJCNLP
2005
Springer
14 years 1 months ago
Inversion Transduction Grammar Constraints for Mining Parallel Sentences from Quasi-Comparable Corpora
Abstract. We present a new implication of Wu’s (1997) Inversion Transduction Grammar (ITG) Hypothesis, on the problem of retrieving truly parallel sentence translations from larg...
Dekai Wu, Pascale Fung
ACL
2011
12 years 11 months ago
Joint Bilingual Sentiment Classification with Unlabeled Parallel Corpora
Most previous work on multilingual sentiment analysis has focused on methods to adapt sentiment resources from resource-rich languages to resource-poor languages. We present a nov...
Bin Lu, Chenhao Tan, Claire Cardie, Benjamin K. Ts...