Toward Active Learning in Data Selection: Automatic Discovery of Language Features During Elicitation