A dedicated greedy pursuit algorithm for sparse spectral representation of music sound


A dedicated algorithm for sparse spectral representation of music sound is presented. The goal is to enable the representation of a piece of music signal as a linear superposition of as few spectral components as possible, without affecting the quality of the reproduction. A representation of this nature is said to be sparse. In the present context sparsity is accomplished by greedy selection of the spectral components, from an overcomplete set called a dictionary. The proposed algorithm is tailored to be applied with trigonometric dictionaries. Its distinctive feature being that it avoids the need for the actual construction of the whole dictionary, by implementing the required operations via the fast Fourier transform. The achieved sparsity is theoretically equivalent to that rendered by the orthogonal matching pursuit (OMP) method. The contribution of the proposed dedicated implementation is to extend the applicability of the standard OMP algorithm, by reducing its storage and computational demands. The suitability of the approach for producing sparse spectral representation is illustrated by comparison with the traditional method, in the line of the short time Fourier transform, involving only the corresponding orthonormal trigonometric basis.

Publication DOI: https://doi.org/10.1121/1.4964342
Divisions: College of Engineering & Physical Sciences > School of Informatics and Digital Engineering > Mathematics
College of Engineering & Physical Sciences > Systems analytics research institute (SARI)
Additional Information: © 2016 Acoustical Society of America
Uncontrolled Keywords: Arts and Humanities (miscellaneous),Acoustics and Ultrasonics
Publication ISSN: 1520-8524
Full Text Link:
Related URLs: http://www.scop ... tnerID=8YFLogxK (Scopus URL)
PURE Output Type: Article
Published Date: 2016-10-28
Accepted Date: 2016-09-22
Submitted Date: 2015-09-28
Authors: Rebollo-Neira, Laura (ORCID Profile 0000-0002-7420-8977)
Aggarwal, Gagan



Version: Published Version

| Preview

Export / Share Citation


Additional statistics for this record