-
Notifications
You must be signed in to change notification settings - Fork 1.4k
Description
Hello there!
I am trying to work on a solution for a blind friend of mine to make all non-spoken texts in Persona 4 and 5 available to him since most OCR software does not seem to handle the in-game text very well. The idea is to train off of the available in-game samples and produce voices that can be used to produce audio for those textboxes that do not have voice data.
So, this is effectively a two-step process: Train a voice off of the voice clips of a character, and then run inference on all the texts that need a voice. However, there's like...a good dozen characters.
How long would it take to train, and is there a "checkpoint" system? As in, if I let it train for N hours a day and then stop, and resume the next, would that be possible? Because I am verry much sure that a single 4090 won't train a voice within one mere day.
Thank you very much and kind regards,
Ingwie