Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods