Multichannel Source Separation Using Time-Deconvolutive CNMF

  • Thadeu Luiz Barbosa Dias PEE/COPPE, UFRJ
  • Wallace Alves Martins PEE/COPPE, UFRJ
  • Luiz Wagner Pereira Biscainho PEE/COPPE, UFRJ

Abstract

This paper addresses the separation of audio sourcesfrom convolutive mixtures captured by a microphone array. Weapproach the problem using complex-valued non-negative matrixfactorization (CNMF), and extend previous works by tailoringadvanced (single-channel) NMF models, such as the deconvolutiveNMF, to the multichannel factorization setup. Further, a sparsity-promoting scheme is proposed so that the underlying estimatedparameters better fit the time-frequency properties inherentin some audio sources. The proposed parameter estimationframework is compatible with previous related works, and can bethought of as a step toward a more general method. We evaluatethe resulting separation accuracy using a simulated acousticscenario, and the tests confirm that the proposed algorithmprovides superior separation quality when compared to a state-of-the-art benchmark. Finally, an analysis on the effects of theintroduced regularization term shows that the solution is in factsteered toward a sparser representation.

Published
14-05-2020
How to Cite
Dias, T. L., Martins, W., & Biscainho, L. W. (2020). Multichannel Source Separation Using Time-Deconvolutive CNMF. Journal of Communication and Information Systems, 35(1), 103-112. https://doi.org/10.14209/jcis.2020.11
Section
Regular Papers