Learning to Predict Readability using Diverse Linguistic Features

15 years 1 months ago

Download www.cs.utexas.edu

In this paper we consider the problem of building a system to predict readability of natural-language documents. Our system is trained using diverse features based on syntax and language models which are generally indicative of readability. The experimental results on a dataset of documents from a mix of genres show that the predictions of the learned system are more accurate than the predictions of naive human judges when compared against the predictions of linguistically-trained expert human judges. The experiments also compare the performances of different learning algorithms and different types of feature sets when used for predicting readability.

Rohit J. Kate, Xiaoqiang Luo, Siddharth Patwardhan

Real-time Traffic

COLING 2010 | Computational Linguistics | Human Judges | Naive Human Judges | Readability |

claim paper

» Learning a Metric for Code Readability

» Using Machine Learning Techniques to Interpret WHquestions

» Learning Fast Classifiers for Image Spam

» Pointwise Prediction for Robust Adaptable Japanese Morphological Analysis

» Turning Lectures into Comic Books Using Linguistically Salient Gestures

» Predicting Emotion in Spoken Dialogue from Multiple Knowledge Sources

» Enhanced Sentiment Learning Using Twitter Hashtags and Smileys

» A global model for joint lemmatization and partofspeech prediction

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	COLING
Authors	Rohit J. Kate, Xiaoqiang Luo, Siddharth Patwardhan, Martin Franz, Radu Florian, Raymond J. Mooney, Salim Roukos, Chris Welty

Comments (0)

Sciweavers

Learning to Predict Readability using Diverse Linguistic Features

COLING 2010 | Computational Linguistics | Human Judges | Naive Human Judges | Readability |

Explore & Download

Productivity Tools

Sciweavers