Sciweavers

1579 search results - page 214 / 316
» Information Retrieval Using Robust Natural Language Processi...
Sort
View
PRIS
2010
13 years 6 months ago
The Impact of Pre-processing on the Classification of MEDLINE Documents
The amount of information available in the MEDLINE database makes it very hard for a researcher to retrieve a reasonable amount of relevant documents using a simple query language ...
Carlos Adriano Gonçalves, Célia Talm...
SOCIALCOM
2010
13 years 6 months ago
Using Text Analysis to Understand the Structure and Dynamics of the World Wide Web as a Multi-Relational Graph
A representation of the World Wide Web as a directed graph, with vertices representing web pages and edges representing hypertext links, underpins the algorithms used by web search...
Harish Sethu, Alexander Yates
ICCPOL
2009
Springer
14 years 1 months ago
Constructing Parallel Corpus from Movie Subtitles
Abstract. This paper describes a methodology for constructing aligned German-Chinese corpora from movie subtitles. The corpora will be used to train a special machine translation s...
Han Xiao, Xiaojie Wang
WWW
2004
ACM
14 years 9 months ago
Web page summarization using dynamic content
Summarizing web pages have recently gained much attention from researchers. Until now two main types of approaches have been proposed for this task: content- and context-based met...
Adam Jatowt
LREC
2008
165views Education» more  LREC 2008»
13 years 10 months ago
Design and Data Collection for Spoken Polish Dialogs Database
Spoken corpora provide a critical resource for research, development and evaluation of spoken dialog systems. This paper describes the telephone spoken dialog corpus for Polish cr...
Krzysztof Marasek, Ryszard Gubrynowicz