—This paper presents a framework for automatic video region-of-interest determination based on user attention model. In this work, a set of attempts on using video attention features and knowledge of applied media aesthetics are made. Three types of visual attention features we used are intensity, color, and motion. Referring to aesthetic principles, these features are combined according to camera motion types on the basis of a newly proposed video analysis unit, framesegment. We conduct subjective experiments on several kinds of video data and demonstrate the effectiveness of the proposed framework.