Deep Learning Framework for Efficient Real-Time Speech Enhancement and Dereverberation

¹ Technion - Israel Institute of Technology, Haifa, Israel

² MISTRIX Ltd., Tel Aviv, Israel

³ Insoundz Ltd., Tel Aviv, Israel

Abstract

Deep learning has revolutionized speech enhancement, enabling impressive high-quality noise reduction and dereverberation. However, state-of-the-art methods often demand substantial computational resources, hindering their deployment on edge devices and in real-time applications. Computationally efficient approaches like deep filtering and Deep Filter Net offer an attractive alternative by predicting linear filters instead of directly estimating the clean speech. While Deep Filter Net excels in noise reduction, its dereverberation performance remains limited. In this paper, we present a generalized framework for computationally efficient speech enhancement and, based on this framework, identify an inherent constraint within Deep Filter Net that hinders its dereverberation capabilities. We propose an extension to the Deep Filter Net framework designed to overcome this limitation, demonstrating significant improvements in dereverberation performance while maintaining competitive noise reduction quality. Our experimental results highlight the potential of this enhanced framework for real-time speech enhancement on resource-constrained devices.

Audio Samples

Noisy Reverberant	Reverberant

Cite this Work

@article{rosenbaum2024dfndereverb,
  title   = {Deep Learning Framework for Efficient Real-Time Speech Enhancement and Dereverberation},
  author  = {Tomer Rosenbaum, Emil Winebrand, Omer Cohen, and Israel Cohen},
  journal = {Sensors},
  year    = {2024}
}