Efficient Metadata Generation to Enable Interactive Data Discovery over Large-Scale Scientific Data Collections