Interpolation between two audio clips? #68

jhurliman · 2023-08-08T18:47:12Z

❓ Questions

I'm curious about using EnCodec to interpolate between audio clips in latent space. However, model.encode(inputs["input_values"], inputs["padding_mask"]) returns discrete integer codes and not a continuous vector representation. Is interpolation possible?

The text was updated successfully, but these errors were encountered:

jhurliman · 2023-08-10T21:43:16Z

My understanding of the code is the embedding creation and quantization are done together when calling model.encode(), making interpolation challenging. I reimplemented encoding and decoding with embedding generation and quantization broken out as separate steps in my repository at https://github.com/jhurliman/music-interpolation. This is not using the quantization at all so effectively only the SEANet encoder-decoder, but with the ability to use the pre-trained "facebook/encodec_*khz" models.

jhurliman added the question Further information is requested label Aug 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Interpolation between two audio clips? #68

Interpolation between two audio clips? #68

jhurliman commented Aug 8, 2023

jhurliman commented Aug 10, 2023

Interpolation between two audio clips? #68

Interpolation between two audio clips? #68

Comments

jhurliman commented Aug 8, 2023

❓ Questions

jhurliman commented Aug 10, 2023