Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ask for data recipe to reproduce Medusa-2 #125

Open
Achazwl opened this issue Oct 24, 2024 · 0 comments
Open

Ask for data recipe to reproduce Medusa-2 #125

Achazwl opened this issue Oct 24, 2024 · 0 comments

Comments

@Achazwl
Copy link

Achazwl commented Oct 24, 2024

In the README.md, you mentioned that

The data preparation code for self-distillation can be found in data_generation folder of the current repo.

In that folder, it says

python generate.py --data_path YOUR_DATA_PATH --output_path YOUR_OUTPUT_PATH --num_threads NUM_THREADS --max_tokens YOUR_MAX_TOKENS --temperature YOUR_TEMPERATURE

Which data/tokens/temperature should I use to reproduce existing Medusa-2 results? Should the --chat format be applied for reproduction? Could you list the full recipe for us?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant