bias in selfAttention #253

wintersurvival · 2020-12-03T02:43:31Z

when running transformer, bias is not existed in selfAttention. mesh_tensorflow/bert has bias in selfAttention.
what's the meaning of relative_attention_type transformer_layer.SelfAttention?
how could I get the bias in transformer_layer.SelfAttention?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bias in selfAttention #253

bias in selfAttention #253

wintersurvival commented Dec 3, 2020

bias in selfAttention #253

bias in selfAttention #253

Comments

wintersurvival commented Dec 3, 2020