Like there’s video AI to enhance video, but podcasts have so many bad connections with guests it’s horrible to listen to. They need to run out through an audio AI.
(Don’t even get me started on asking the President questions when the fucking helicopter is in the background. Who tf can hear anything?)
noisetorch does an ok job for video conferences and works on your speaker (easier on you) and on your microphone (easier on them). We often use it to limit keyboard noise during meetings.
I’m using https://auphonic.com/ for my podcasts.
If you are fine with non-free shit that will probably train with your audio, try out adobe’s AI.
Davinci resolve studio has had a pretty great AI de-noisier for awhile now! They also use AI to be able to turn off certain instruments/parts of songs ( and its local!)
Actually they’ve been here for a very long time.
DeBird is the oldest commercial ML/Audio product I know. It’s around since 2017.
Accentize plugins are the commercial ones.
Also there’s a Waves one.
Nvidia’s RTX Voice can do it in real time since 2020.
And lastly on the open-source end, there’s Intel’s OpenVino for Audacity.
RTX Voice was renamed to Nvidia Broadcast: https://www.nvidia.com/en-us/geforce/broadcasting/broadcast-app/
Not aware of any but have a look at https://replicate.com/explore
Got a good recommendation for the video one you mentioned? I have some videos digitized from VHS tapes that I’d like to clean up. (I’m only interested in models I can run locally, BTW, for obvious privacy reasons.)
Ultimate Vocal Remover is a FOSS app using FOSS models for creating stems and extracting vocals, but it also has a few de-noising models.