On the Statistical Consistency of DOP Estimators

15 years 3 months ago

Download www.cnts.ua.ac.be

A statistical estimator attempts to guess an unknown probability distribution by analyzing a sample from this distribution. One desirable property of an estimator is that its guess is increasingly likely to get arbitrarily close to the actual distribution as the sample size increases. This property is called consistency. Data Oriented Parsing (DOP) employs all fragments of the trees in a training treebank, including the full parse-trees themselves, as the rewrite rules of a probabilistic treesubstitution grammar. Since the most popular DOP-estimator (DOP1) was shown to be inconsistent, there is an outstanding theoretical question concerning the possibility of DOPestimators with reasonable statistical properties. This question constitutes the topic of the current paper. First, we show that, contrary to common wisdom, any unbiased estimator for DOP is futile because it will not generalize over the training treebank. Subsequently, we show that a consistent estimator that generalizes over...

Detlef Prescher, Remko Scha, Khalil Sima'an, Andre

Real-time Traffic

CLIN 2003 | Computational Linguistics | Estimator | Statistical Estimator Attempts | Training Treebank |

claim paper

» Statistical test for consistent estimation of causal effects in linear nonGaussian models

» NoiseContrastive Estimation of Unnormalized Statistical Models with Applications to Natura...

» Noisecontrastive estimation A new estimation principle for unnormalized statistical models

» Kernel Partial Least Squares is Universally Consistent

» Consistency of Pseudolikelihood Estimation of Fully Visible Boltzmann Machines

» Consistent Estimators of Median and Mean Graph

» Computational Complexity of Probabilistic Disambiguation by means of TreeGrammars

» Consistency Checks for Particle Filters

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	CLIN
Authors	Detlef Prescher, Remko Scha, Khalil Sima'an, Andreas Zollmann

Comments (0)

Sciweavers

On the Statistical Consistency of DOP Estimators

CLIN 2003 | Computational Linguistics | Estimator | Statistical Estimator Attempts | Training Treebank |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers