Discriminative learning techniques for sequential data have proven to be more effective than generative models for named entity recognition, information extraction, and other task...
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
This paper presents a statistical, learned approach to finding names and other nonrecursive entities in text (as per the MUC-6 definition of the NE task), using a variant of the s...
Daniel M. Bikel, Scott Miller, Richard M. Schwartz...
We present conditional random fields, a framework for building probabilistic models to segment and label sequence data. Conditional random fields offer several advantages over hid...
John D. Lafferty, Andrew McCallum, Fernando C. N. ...
In many pattern recognition tasks, given some input data and a family of models, the “best” model is defined as the one which maximizes the likelihood of the data given the m...
Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabha...