A Language-Independent Approach to Identify the Named Entities in Under-Resourced Languages and Clustering Multilingual Document