Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder

Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder

Paper: TBE

Authors: Kun-Hsi Tsai, Wei-Chien Wang, Chui-Hsuan Cheng, Chan-Yen Tsai, Jou-Kou Wang, Tzu-Hao Lin, Shih-Hau Fang, Li-Chin Chen, Yu Tsao

Abstract: Auscultation is the most efficient way to diagnose cardiovascular and respiratory diseases. To reach accurate diagnoses, a device must be able to recognize heart and lung sounds from various clini-cal situations. However, the recorded chest sounds are mixed by heart and lung sounds. Thus, effectively separating these two sounds is critical in pre-processing stage. Recent advances in machine learning have progressed on monaural source separations, but most of the well-known techniques require paired mixed sounds and indi-vidual pure sounds for model training. As to the preparation of pure heart and lung sounds is extremely difficult, special designs must be considered to derive effective heart and lung sound separation tech-niques. In this study, we proposed a novel periodicity-coded deep auto-encoder (PC-DAE) approach to separate mixed heart-lung sounds in an unsupervised manner via the assumption of different periodicities between heart rate and respiration rate. The PC-DAE benefits from deep-learning-based models by extracting representa-tive features and considers the periodicity of heart and lung sounds to carry out the separation. We evaluated PC-DAE on two datasets. The first one includes sounds from the Student Auscultation Mani-kin (SAM), and the second is prepared by recording chest sounds in real-world conditions. Experimental results indicate that PC-DAE outperforms several well-known separation works in terms of standardized evaluation metrics. Moreover, waveforms and spec-trograms demonstrate the effectiveness of PC-DAE compared to existing approaches. It is also confirmed that by using the proposed PC-DAE as a pre-processing stage, the heart sound recognition ac-curacies can be notably boosted. The experimental results con-firmed the effectiveness of PC-DAE and its potential to be used in clinical applications.

The convolutional deep autoencoder (DAE(C)) architecture. Flowchart:

Modulation frequency analysis method comparision(DFT, Wavelets)

  • We added the following description in the revised manuscript: Please also note that any particular time-frequency representation method can be used to perform MFA. The present study adopts the DFT as a representative method. Other time-frequency representation methods, such as CWT [29-31] and Hilbert–Huang transform [63-65], can be used. When using these methods, suitable basis functions or prior knowledge need to be carefully considered. In this study, we intend to focus our attention on DFT and will further explore other time-frequency representation methods in the future.
  • Evaluation results of separated heart sounds and lung sounds generated by the proposed PC-DAE(C) with CWT and DFT (termed PC-DAE(C)-CWT and PC-DAE(C)-DFT, respectively) in terms of SDR, SIR, and SAR. Avg denotes the average scores over five SNR levels.
  • Supplement Table 1 Supplement Table 2

    Random audio samples from Student Auscultation Manikin (SAM) testing set(The speaker possible can't play the heart sound(low frequency), earphonesis the better choice.)

    Meaning of the columns in the table below:

    1. The mix heart-lung sound input to the PC-DAE(C). It's generated by summing the heart sound with lung sound in direct SNR.
    2. The clean audio, which is the ground truth.
    3. The separated results(heart, lung sound) from the PC-DAE(C) with Wavelets MFA and PC-DAE(C) with DFT MFA.

    Case1:

    Mix heart-lung sound clean heart sound clean lung sound

    Separated heart sound(PC(DFT)) Separated lung sound(PC(DFT)) Separated heart sound(PC(Wavelets)) Separated lung sound(PC(Wavelets))

    Case2:

    Mix heart-lung sound clean heart sound clean lung sound

    Separated heart sound(PC(DFT)) Separated lung sound(PC(DFT)) Separated heart sound(PC(Wavelets)) Separated lung sound(PC(Wavelets))

    Case3:

    Mix heart-lung sound clean heart sound clean lung sound

    Separated heart sound(PC(DFT)) Separated lung sound(PC(DFT)) Separated heart sound(PC(Wavelets)) Separated lung sound(PC(Wavelets))

    Case4:

    Mix heart-lung sound clean heart sound clean lung sound

    Separated heart sound(PC(DFT)) Separated lung sound(PC(DFT)) Separated heart sound(PC(Wavelets)) Separated lung sound(PC(Wavelets))

    Code

  • TBE
  • Dataset information

  • Cardionics. "SAM® 3G — Student Auscultation Manikin
  • Copyright

  • Imediplus
  • Bio-ASP LAB