You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the code!I found that the code may have the following problems:
The token input to the model decoder at each time step is the same as the output token, so the accuracy of the model is very high. But the correct input to the decoder is the token at time t-1, not the token at time t.
teacher forcing can only be used for model training, but the code uses it when evaluating the model.
The text was updated successfully, but these errors were encountered:
您的代码对我很有帮助,但是我发现代码可能存在如下问题:
Thanks for the code!I found that the code may have the following problems:
The text was updated successfully, but these errors were encountered: