Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reward model #2

Open
kirudang opened this issue Oct 13, 2024 · 1 comment
Open

Reward model #2

kirudang opened this issue Oct 13, 2024 · 1 comment

Comments

@kirudang
Copy link

Hello there,
I want to test this watermark for two model: Llama2 7B and Mistral 7B, can I use a same reward model, let's say OPT 1.3B?
Thank you

@xiaojunxu
Copy link
Owner

Our implementation does not directly support it, as we are using the same tokenizer for the LLM and the reward model. Since the models you mentioned use different tokenizers, you may need some adaptations in the code to get it work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants