Sciweavers

501 search results - page 3 / 101
» Improving Language Models by Clustering Training Sentences
Sort
View
ACL
2010
13 years 5 months ago
A Hybrid Hierarchical Model for Multi-Document Summarization
Scoring sentences in documents given abstract summaries created by humans is important in extractive multi-document summarization. In this paper, we formulate extractive summariza...
Asli Çelikyilmaz, Dilek Hakkani-Tur
ICML
2008
IEEE
14 years 7 months ago
A unified architecture for natural language processing: deep neural networks with multitask learning
We describe a single convolutional neural network architecture that, given a sentence, outputs a host of language processing predictions: part-of-speech tags, chunks, named entity...
Ronan Collobert, Jason Weston
ACL
2008
13 years 8 months ago
Mining Wikipedia Revision Histories for Improving Sentence Compression
A well-recognized limitation of research on supervised sentence compression is the dearth of available training data. We propose a new and bountiful resource for such training dat...
Elif Yamangil, Rani Nelken
CSL
2006
Springer
13 years 7 months ago
A study in machine learning from imbalanced data for sentence boundary detection in speech
Enriching speech recognition output with sentence boundaries improves its human readability and enables further processing by downstream language processing modules. We have const...
Yang Liu, Nitesh V. Chawla, Mary P. Harper, Elizab...
EMNLP
2008
13 years 8 months ago
Improved Sentence Alignment on Parallel Web Pages Using a Stochastic Tree Alignment Model
Parallel web pages are important source of training data for statistical machine translation. In this paper, we present a new approach to sentence alignment on parallel web pages....
Lei Shi, Ming Zhou