Sciweavers

33 search results - page 2 / 7
» Corpus Linguistics for Establishing The Natural Language Con...
Sort
View
COLING
2000
13 years 8 months ago
Layout and Language: Integrating Spatial and Linguistic Knowledge for Layout Understanding Tasks
Complex documents stored in a flat or partially marked up file format require layout sensitive preprocessing before any natural language processing can be carried out on their tex...
Matthew Hurst, Tetsuya Nasukawa
LREC
2010
160views Education» more  LREC 2010»
13 years 9 months ago
Corpus and Evaluation Measures for Automatic Plagiarism Detection
The simple access to texts on digital libraries and the WWW has led to an increased number of plagiarism cases in recent years, which renders manual plagiarism detection infeasibl...
Alberto Barrón-Cedeño, Martin Pottha...
WWW
2009
ACM
14 years 8 months ago
Automatically assessing resource quality for educational digital libraries
With the rise of community-generated web content, the need for automatic assessment of resource quality has grown, particularly in the realm of educational digital libraries. We d...
Philipp G. Wetzler, Steven Bethard, Kirsten R. But...
CLEF
2005
Springer
14 years 1 months ago
EuroGOV: Engineering a Multilingual Web Corpus
EuroGOV is a multilingual web corpus that was created to serve as the document collection for WebCLEF, the CLEF 2005 web retrieval task. EuroGOV is a collection of web pages crawl...
Börkur Sigurbjörnsson, Jaap Kamps, Maart...
AND
2009
13 years 5 months ago
Accessing the content of Greek historical documents
In this paper, we propose an alternative method for accessing the content of Greek historical documents printed during the 17th and 18th centuries by searching words directly in d...
Anastasios L. Kesidis, Eleni Galiotou, Basilios Ga...