Releases · belladoreai/llama-tokenizer-js

27 Jun 17:59

v1.2.2 Latest

Latest

Minor fixes to decode, which should not have any effect except for situations where the user inputs invalid parameter combinations:

When user decodes with add_bos_token set to True, before this change we would just cut the first token out, assuming it must be bos token, but now we check if it's the bos token, and don't cut the first token if it's something else
When user decodes with add_preceding_space set to True, before this change we would just assume that the first character after decoding must be space, and we would cut it out, but now we check if it's space, and don't cut it out if it's something else

Assets 2

24 Mar 18:40

belladoreai

v1.2.1

TypeScript fix

Assets 2

24 Mar 18:17

belladoreai

v1.2.0

Assets 2

07 Aug 21:50

belladoreai

v1.1.3

Fix bug in a function that was unused (so not affecting tokenizer results)
Support very large inputs (previous version was not guaranteed to produce correct results for inputs larger than 100 000 characters, although in practice it would almost always produce correct results for large inputs)

Assets 2