Sciweavers

2929 search results - page 7 / 586
» Models of English Text
Sort
View
IPM
2008
196views more  IPM 2008»
13 years 7 months ago
Author identification: Using text sampling to handle the class imbalance problem
Authorship analysis of electronic texts assists digital forensics and anti-terror investigation. Author identification can be seen as a single-label multi-class text categorizatio...
Efstathios Stamatatos
IJDAR
2007
106views more  IJDAR 2007»
13 years 7 months ago
Investigation and modeling of the structure of texting language
Language usage over computer mediated discourses, like chats, emails and SMS texts, significantly differs from the standard form of the language. An urge towards shorter message l...
Monojit Choudhury, Rahul Saraf, Vijit Jain, Animes...
ACL
1996
13 years 8 months ago
The Rhythm of Lexical Stress in Prose
\Prose rhythm" is a widely observed but scarcely quanti ed phenomenon. We describe an information-theoretic model for measuring the regularity of lexical stress in English te...
Doug Beeferman
ANLP
1997
78views more  ANLP 1997»
13 years 8 months ago
EasyEnglish: A Tool for Improving Document Quality
We describe the authoring tool, EasyEnglish, which is part of IBM's internal SGML editing environment, Information Development Workbench. EasyEnglish helps writers produce cl...
Arendse Bernth
ACL
2003
13 years 8 months ago
Unsupervised Learning of Arabic Stemming Using a Parallel Corpus
This paper presents an unsupervised learning approach to building a non-English (Arabic) stemmer. The stemming model is based on statistical machine translation and it uses an Eng...
Monica Rogati, J. Scott McCarley, Yiming Yang