train_new_from_iterator()
does not work when pre_tokenizer is null
#35315
Labels
train_new_from_iterator()
does not work when pre_tokenizer is null
#35315
System Info
transformers version 4.47.1
Ubuntu 20.04.6 LTS
Python 3.10
Who can help?
@ArthurZucker, @itazap
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
Follow the steps listed in https://huggingface.co/learn/nlp-course/chapter6/2, however use
microsoft/Phi-3.5-mini-instruct
as the model instead ofgpt2
.Expected behavior
The tokenizer should be trained.
The text was updated successfully, but these errors were encountered: