In this paper, we discuss a user interface issue with regard to automatic video editing based on the speaker’s intentions. In our experiments, the subjects used previously developed video capturing system to employ 4 types of editing methods by making 3 types of presentations. Subjective evaluation revealed that the editing method that used behaviors-for-attention obtained a good score for a presentation in which the subject was provided with specific instructions regarding the tasks to be performed. In the case of a presentation without a scenario, an editing method using a footswitch and a posture obtained a higher score. It can be concluded that a combination of both behavior-based and footswitch-based editing would provide a good environment for content acquisition.