This paper presents a novel user interaction concept for document image scanning with mobile phones. A high resolution mosaic image is constructed in two main stages. Firstly, online camera motion estimation is applied to the phone to assist the user to capture small image patches of the document page. Automatic image stitching process with the help of estimated device motion is carried out to reconstruct the full view of the document. Experiments on document images captured and processed with mosaicing software clearly show the feasibility of the approach.