Skip to content

Question about padding mask, and using model's encoder features #307

Answered by jongwook
jasonppy asked this question in Q&A
Discussion options

You must be logged in to vote

We haven't done any experiments on truncated encoder features, but I'd expect it'd be just fine to do so, e.g. using the first 150 out of 1500 tokens for a 3-second audio.

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@jasonppy
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by jongwook
Comment options

You must be logged in to vote
1 reply
@d2a-raudenaerde
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
5 participants