DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement
Paper
β’ 2305.08227 β’ Published
β’ 1
Real-time speech enhancement model for Apple Silicon. Removes background noise from speech audio.
| Duration | Time | RTF |
|---|---|---|
| 5s | 0.65s | 0.13 |
| 10s | 1.2s | 0.12 |
| 20s | 4.8s | 0.24 |
import SpeechEnhancement
let enhancer = try await SpeechEnhancer.fromPretrained()
let clean = try enhancer.enhance(audio: noisyAudio, sampleRate: 48000)
swift run audio denoise noisy.wav --output clean.wav
DeepFilterNet3.mlpackage β Core ML FP16 model (Neural Engine)auxiliary.npz β ERB filterbank, Vorbis window, normalization states