Fix panic in DecodeStream::step due to incorrect index usage #1699

n0gu-furiosa · 2024-12-11T05:23:25Z

When calling DecodeStream::step multiple times, it eventually panics with attempt to subtract with overflow in the following lines of code:

tokenizers/tokenizers/src/tokenizer/mod.rs

Lines 1108 to 1109 in 24d29f4

    
           let new_prefix_index = ids.len() - *prefix_index; 
        
           *ids = ids.drain(*read_index..).collect();

The panic can be easily reproduced, and I have added a test case to demonstrate the issue.

Upon inspecting the code, I found that the shrinking of the token buffer references read_index instead of prefix_index. This PR corrects the issue by using the correct index.

However, this change makes read_index unused, so I am not entirely certain if it aligns with the intended logic of the original implementation. Please let me know if further adjustments or clarifications are needed, or if there is additional context regarding the intended use of read_index.

Narsil · 2025-01-09T12:23:02Z

However, this change makes read_index unused, so I am not entirely certain if it aligns with the intended logic of the original implementation. Please let me know if further adjustments or clarifications are needed, or if there is additional context regarding the intended use of read_index.

I'm very confused by my own implementation re-reading it. Your fix is definitely correct and we can remove than internal pointer.
The prefix/read difference stems from another implementation I had which tried to keep all "side effects contained (in the read segment part, which isn't ever compared) and keep the prefix segment distinct.
However in this implementation because we recalculate the prefix *prefix = tokenizer.decode(..) then we re-encapsulate the side effect which is simpler.

What's weird is that I remember having only that in my first iteration of this. I have no idea why I re-added half-brokenly that read_index.

Thanks a lot for the fix.

Note: I will rewrite your unit test. They are good but I find them not really linearly readable and also panicking within tests is perfectly fine (it's a failure).

n0gu-furiosa added 3 commits December 11, 2024 13:59

Add a failing test for step_decode_stream

30379e9

Improve test case for test_decode_stream_step_no_panic

84099e2

Fix subtract with overflow issue in step_decode_stream

b449a99

irexyc mentioned this pull request Dec 26, 2024

DecodeStream raise error #1705

Open

Narsil merged commit 862d1a3 into huggingface:main Jan 9, 2025
22 of 29 checks passed

n0gu-furiosa deleted the fix-decode-stream-overflow branch January 10, 2025 01:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix panic in DecodeStream::step due to incorrect index usage #1699

Fix panic in DecodeStream::step due to incorrect index usage #1699

n0gu-furiosa commented Dec 11, 2024

Narsil commented Jan 9, 2025 •

edited

Loading

	let new_prefix_index = ids.len() - *prefix_index;
	ids = ids.drain(read_index..).collect();

Fix panic in DecodeStream::step due to incorrect index usage #1699

Fix panic in DecodeStream::step due to incorrect index usage #1699

Conversation

n0gu-furiosa commented Dec 11, 2024

Narsil commented Jan 9, 2025 • edited Loading

Narsil commented Jan 9, 2025 •

edited

Loading