Confidence estimation, OOV detection and language ID using phone-to-word transduction and phone-level alignments