Skip to content

Confusion in injecting speaker information while finetuning the model. #302

Closed Answered by eginhard
RamakrishnaChaitanya asked this question in Q&A
Discussion options

You must be logged in to vote

There are 2 ways in Coqui to encode speaker information:

  1. Train speaker embeddings as part of the model ("use_speaker_embedding": true, in the config)
  2. Use an external speaker encoder model to get speaker embeddings ("use_d_vector_file": true, in the config)

When fine-tuning a model, you'd need to check its config to see what method it uses.

Original repo crosslink: coqui-ai#4155

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by RamakrishnaChaitanya
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants