You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There are errors when training lt_alknsis and fr_sequoia with a batch size of 8. I have not fully diagnosed the issue but I am quite confident that the problem comes from using the list of oracle actions from the metadata 'gold_actions' list instead of from a tensor as was done in the old code, see
This is because we pop from this list after an action is taken. In the majority of cases, this works fine, since we should only see a sentence once for every training iteration. However, there seems to be some weird things happening with allennlp where a sentence can be repeated in a batch (presumably to make sure the batch is full). This means that the second time that we see the sentence, the correct action that we should take has been popped from the list.
The easiest fix is probably to revert back to using tensors as gold actions.
The text was updated successfully, but these errors were encountered:
There are errors when training lt_alknsis and fr_sequoia with a batch size of 8. I have not fully diagnosed the issue but I am quite confident that the problem comes from using the list of oracle actions from the metadata 'gold_actions' list instead of from a tensor as was done in the old code, see
koepsala-parser/modules/transition_parser_eud.py
Line 492 in 56ac985
This is because we pop from this list after an action is taken. In the majority of cases, this works fine, since we should only see a sentence once for every training iteration. However, there seems to be some weird things happening with allennlp where a sentence can be repeated in a batch (presumably to make sure the batch is full). This means that the second time that we see the sentence, the correct action that we should take has been popped from the list.
The easiest fix is probably to revert back to using tensors as gold actions.
The text was updated successfully, but these errors were encountered: