There are some pretty magic tools now, that help with voice tracks like this. Davinci Resolve has one in the paid version. There are also some new AI tools that I've used that are incredible. The one I used was called ultimate vocal remover - coming from a sound engineering background, it defy's the laws of physics and nearly brings back the master track without having it, which is pretty cool. So those are two areas you can look into. It's slower on CPU of course, but if you have a supported GPU it's pretty quick.
Here's a demo of the Davinci Resolve one on youtube. Not sure how well it will do with echoes.
https://www.youtube.com/watch?v=9k7-OtG2lAE&t=39s
|