Prediction-Based Gesture Detection in Lecture Videos by Combining Visual, Speech and Electronic Slides