-
Notifications
You must be signed in to change notification settings - Fork 27.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix model kwargs #35875
Merged
Merged
Fix model kwargs #35875
Changes from all commits
Commits
Show all changes
45 commits
Select commit
Hold shift + click to select a range
d3c618e
Save state
muellerzr c489527
Make a failing test
muellerzr 8a58190
Better test
muellerzr 4348e36
mpt -> done, many more to go
muellerzr 3b3dfd2
Rm extranious
muellerzr 2bf5390
Bamba
muellerzr 34f9060
Bert
muellerzr 3960502
big_bird
muellerzr a87ed15
biogpt
muellerzr 2705ae6
bloom
muellerzr 33e718b
codegen
muellerzr e215848
ctrl
muellerzr 72459fa
data2vec
muellerzr 212ee51
dbrx
muellerzr 8159793
Through up to Dbrx
muellerzr f5cf781
electra
muellerzr 96e26f6
ernie
muellerzr 1ac07d3
falcon
muellerzr 9666691
Fuyu/persimmon
muellerzr d2d8f8e
Include noop kwargs to base models
muellerzr bf112ca
Rebase
muellerzr 308b91d
Skip musigen
muellerzr ad5e487
Refactor/skip mllama
muellerzr 14c121d
Revert makefile
muellerzr fcf896c
Rm file
muellerzr 24b59bf
Fix PT failing, need to modify rest of loss funcs to not resize
muellerzr 6320ab4
Propagate some
muellerzr 44530b6
Continue
muellerzr 978dbbe
More
muellerzr ea4484e
More options
muellerzr 12627ef
Mostly fixed
muellerzr dc42e65
Proved that it's the same
muellerzr 9f23ae7
Bloom is good
muellerzr 12c00f6
Make ability to override loss func possible
muellerzr b6fb606
Fixup
muellerzr cfb3bcf
Clean
muellerzr f7eda3b
Fix xglm
muellerzr 6d34419
Quality tests
muellerzr c103851
Skip OCR2
muellerzr bde0bef
Make specific loss for xglm
muellerzr 2f951dd
Make order the same/line up 1:1
muellerzr 5204b53
xglm
muellerzr 038dc55
Skip fx output loss bloom model
muellerzr 6033db8
Didn't pass in pad_token_id
muellerzr ff06a1d
Fix quality
muellerzr File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ArthurZucker this was needed to be added for a few models that don't need everything the loss func was up to. Case was
xglm