You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Yeah it is strange that is has the log there. I found another implementation that did not use log but softmax only for both Teacher and Student.
As you might have already known it, when we modify the F.log_softmax(out_s/self.T, dim=1) in to F.softmax(out_s/self.T, dim=1), the KD output is negative.
Hi,
I just have a question, why did you use an additional log before the softmax function in st.py loss on one input but not the other?
The text was updated successfully, but these errors were encountered: