🦜 voice-cloning-app

Last year I was using Voice-Cloning-App to create Tacotron 2 models, and archived a working copy using Docker, as it was dying of bitrot. This is the case with many Python machine learning projects, which seem to have a limited shelf life due to the pace of development, hardware and driver dependencies, and the size of the models are too big to stick in a git repo (without incurring charges at least).

Well now I’ve got a bigger disk and Linux running on my laptop, I decided to do some more cloning. Turns out that the creator, Ben has archived the project. With newer models like RVC and Bark being all the rage, most of the community have moved on. But I personally like Tacotron2 as it actually works with my accent, and the cloning app is really easy to use.

So I messaged him and asked if he’d unarchive it and transition to a GitHub org, he can keep ownership and I’ll do some maintenance and invite contributors to join as and when they resurface.

And he agreed 🎉, so now I’ve inherited a Discord server and a project with 1.4k stars, which seems like an awful lot of responsibility.

A few things that need(ed) fixing:

So once I get CI working again and make the thing run with data from the Internet Archive, I’ll do a new release, and think about how it can be extended with some of the ideas that I’ve been working on for decentralizing this sort of thing.

For the moment, the Docker nvidia version is gonna be the most up-to-date as it’s the one I’m using myself.