Natural language processing and e-Government: crime information extraction from heterogeneous data sources