The Synchronized Short-Time-Fourier-Transform: Properties and Definitions for Multichannel Source Separation

Ruairí de Fréin, Scott Rickard

    Research output: Contribution to journalArticlepeer-review

    Abstract

    This paper proposes the use of a synchronized linear transform, the synchronized short-time-Fourier-transform (sSTFT), for time-frequency analysis of anechoic mixtures. We address the short comings of the commonly used time-frequency linear transform in multichannel settings, namely the classical short-time-Fourier-transform (cSTFT). We propose a series of desirable properties for the linear transform used in a multichannel source separation scenario: stationary invertibility, relative delay, relative attenuation, and finally delay invariant relative windowed-disjoint orthogonality (DIRWDO). Multisensor source separation techniques which operate in the time-frequency domain, have an inherent error unless consideration is given to the multichannel properties proposed in this paper. The sSTFT preserves these relationships for multichannel data. The crucial innovation of the sSTFT is to locally synchronize the analysis to the observations as opposed to a global clock. Improvement in separation performance can be achieved because assumed properties of the time-frequency transform are satisfied when it is appropriately synchronized. Numerical experiments show the sSTFT improves instantaneous subsample relative parameter estimation in low noise conditions and achieves good synthesis.
    Original languageEnglish
    Pages (from-to)91-103
    JournalIEEE Transactions on Signal Processing
    Volume59
    Issue number1
    Publication statusPublished - 2011

    Fingerprint

    Dive into the research topics of 'The Synchronized Short-Time-Fourier-Transform: Properties and Definitions for Multichannel Source Separation'. Together they form a unique fingerprint.

    Cite this