Using interview data to identify evaluation criteria for interactive, analytical question-answering systems