Over the past century alone, millions of hours of audiovisual data have been collected with great potential for e.g., new creative productions, research and educational purposes. ...
This paper reports a comparison of user performance (time and accuracy) when controlling a popular arcade game of Tetris using speech recognition or non-speech (humming) input tec...
Adam J. Sporka, Sri Hastuti Kurniawan, Murni Mahmu...
Abstract. Although older people are an important user group for smart environments, there has been relatively little work on adapting natural language interfaces to their requireme...
Ravichander Vipperla, Maria Wolters, Kallirroi Geo...
We investigate methods of segmenting, visualizing, and indexing presentation videos by both audio and visual data. The audio track is segmented by speaker, and augmented with key ...
Driving behavior has been trending towards more time in the car and longer commutes. This has fueled the demand for an increasing number of in-vehicle infotainment features, at th...
Jackie C. Chang, Annie Lien, Brian Lathrop, Holger...