User guided audio selection from complex sound mixtures

14 years 7 months ago

Download web.media.mit.edu

In this paper we present a novel interface for selecting sounds in audio mixtures. Traditional interfaces in audio editors provide a graphical representation of sounds which is either a waveform, or some variation of a time/frequency transform. Although with these representations a user might be able to visually identify elements of sounds in a mixture, they do not facilitate object-speciﬁc editing (e.g. selecting only the voice of a singer in a song). This interface uses audio guidance from a user in order to select a target sound within a mixture. The user is asked to vocalize (or otherwise sonically represent) the desired target sound, and an automatic process identiﬁes and isolates the elements of the mixture that best relate to the user’s input. This way of pointing to speciﬁc parts of an audio stream allows a user to perform audio selections which would have been infeasible otherwise. ACM Classiﬁcation: H.5.5 [Multimedia Information Systems]: Sound and Music Computing,...

Paris Smaragdis

Real-time Traffic

Audio Mixtures | Software Engineering | Target Sound | UIST 2009 | User |

claim paper

Post Info
More Details (n/a)

Added	28 May 2010
Updated	28 May 2010
Type	Conference
Year	2009
Where	UIST
Authors	Paris Smaragdis

Comments (0)

Sciweavers

User guided audio selection from complex sound mixtures

Audio Mixtures | Software Engineering | Target Sound | UIST 2009 | User |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers