Search Sciweavers | Sciweavers

34 search results - page 2 / 7

» Mining the Web to Create Minority Language Corpora

click to vote

SIGIR
2004
ACM

131views Information Technology» more SIGIR 2004»

Translating unknown queries with web corpora for cross-language information retrieval

14 years 27 days ago

Download www.iis.sinica.edu.tw

It is crucial for cross-language information retrieval (CLIR) systems to deal with the translation of unknown queries1 due to that real queries might be short. The purpose of this...

Pu-Jen Cheng, Jei-Wen Teng, Ruey-Cheng Chen, Jenq-...

claim paper

Read More »

click to vote

EACL
2006
ACL Anthology

143views Natural Language Processing» more EACL 2006»

Web Text Corpus for Natural Language Processing

13 years 9 months ago

Download www.cs.usyd.edu.au

Web text has been successfully used as training data for many NLP applications. While most previous work accesses web text through search engine hit counts, we created a Web Corpu...

Vinci Liu, James R. Curran

claim paper

Read More »

click to vote

AMTA
1998
Springer

103views Information Technology» more AMTA 1998»

Parallel Strands: A Preliminary Investigation into Mining the Web for Bilingual Text

13 years 11 months ago

Download www.lib.umd.edu

Abstract. Parallel corpora are a valuable resource for machine translation, but at present their availability and utility is limited by genreand domain-speci city, licensing restri...

Philip Resnik

claim paper

Read More »

click to vote

LREC
2008

108views Education» more LREC 2008»

A Lightweight and Efficient Tool for Cleaning Web Pages

13 years 9 months ago

Download www.lrec-conf.org

Originally conceived as a "naive" baseline experiment using traditional n-gram language models as classifiers, the NCLEANER system has turned out to be a fast and lightw...

Stefan Evert

claim paper

Read More »

click to vote

OTM
2005
Springer

150views Internet Technology» more OTM 2005»

Creating Ontologies for Content Representation-The OntoSeed Suite

14 years 29 days ago

Download www.ling.uni-potsdam.de

Abstract. Due to the inherent diﬃculties associated with manual ontology building, knowledge acquisition and reuse are often seen as methods that can make this tedious process ea...

Elena Paslaru Bontas, David Schlangen, Thomas Schr...

claim paper

Read More »

« Prev « First page 2 / 7 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers