Installing on windows and run voice infill:
Step 0: Git clone and download the models
git clone https://github.com/jasonppy/VoiceCraft
cd VoiceCraft
Step 1: Update conda and install, set solver to libmamba
https://www.anaconda.com/blog/conda-is-fast-now
Step 2: Create env
Step 3: Install ffmpeg and espeak-ng, add paths to path, make sure you can call it in the cmd
https://bootphon.github.io/phonemizer/install.html
Step 4: Install in order:
Step 5: Download the MFA model
when running, it will scream "no triton installed". This is apparently fine https://github.com/invoke-ai/InvokeAI/issues/2611
also just pip install missing libraries when needed. All needfuls must be done