Automatic adaptation of proper noun dictionaries through cooperation of machine learning and probabilistic methods