Audio Codes "at utterance level" #83

juanilarregui · 2024-04-30T18:01:49Z

❓ Questions

I'm interested in using the encoder to encode an audio fragment of a few seconds into just one codebook vector. However, the model returns a sequence of several audio_codes (of course, it is the only way to succesfully decode the audio afterwards).

How would you recommend using the encoder, and/or pre-postprocessing the audio input or audio_codes to obtain just one audio code "at utterance level"?

Thanks in advance.

The text was updated successfully, but these errors were encountered:

juanilarregui added the question Further information is requested label Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Audio Codes "at utterance level" #83

Audio Codes "at utterance level" #83

juanilarregui commented Apr 30, 2024

Audio Codes "at utterance level" #83

Audio Codes "at utterance level" #83

Comments

juanilarregui commented Apr 30, 2024

❓ Questions