Refactor streaming and g-sampling FSM logics #239

jeffreymeetkai · 2024-08-08T09:31:03Z

prompt_template.initialize_fsm_gen_state initializes prompt template's FSM gen state. Both streaming and gsampling will call this method at the start.
In streaming, the generator will repeatedly call prompt_template.stream_delta_text method that streams specific delta texts and updates the gen state at every iteration
In gsampling, monkey-patched async_llm_engine will call prompt_template.grammar_sample method that grammar samples the tokens and updates the gen state at every iteration
prompt_template.update_fsm_gen_state updates the gen_state.
If both gsampling is enabled and the request is a streaming request, grammar sampling will first create and maintain a FSM to produce the grammar-sampled token to yield to the wrapping generator. The wrapping streaming generator will also create and maintain a separate FSM to stream the grammar-sampled tokens.

khai-meetkai · 2024-08-12T01:47:01Z

functionary/prompt_template/base_template.py

+        gen_state["func_name"] = func_name
+        gen_state["func_index"] += 1
+        gen_state["call_id"] = prompt_utils.get_random_tool_call_id()
+        gen_state["first_time_func"] = True


Oh why this is always True?

If this is True, it means we need to stream an empty chunk before streaming the chunks containing the function name and arguments. Thereafter, we will set this to False.

khai-meetkai · 2024-08-12T04:20:55Z

functionary/prompt_template/llama3_prompt_template_v3.py

-                    empty_response = prompt_utils.get_text_delta_response(
-                        "", True, finish_reason
+        # Form the options for the following stages
+        options = []


I think we can create a function for getting options, this is kind of duplicate in function: stream_delta_text and grammar_sample

Sure will do

…mbine-streaming-gsampling-fsm

jeffreymeetkai added 5 commits August 7, 2024 15:30

refactor fsm logic for v3.0 wip

3323257

refactor fsm logic for v3.0

a258956

refactor fsm logic for v2.5

7452251

refactor fsm logic for v3.1

6312758

refactor fsm logic for v2

c5c4cd1

jeffreymeetkai requested review from musab-mk and khai-meetkai August 8, 2024 09:31

jeffreymeetkai added 2 commits August 8, 2024 09:43

fix unittests

403293c

fix unittest

fe924b9

khai-meetkai reviewed Aug 13, 2024

View reviewed changes

jeffrey-fong added 3 commits August 13, 2024 15:53

Merge branch 'main' of https://github.com/MeetKai/functionary into co…

de5e3b8

…mbine-streaming-gsampling-fsm

edit based on comments

5bc68ea

fix

a6d1974

khai-meetkai approved these changes Aug 14, 2024

View reviewed changes

jeffreymeetkai merged commit 2041dad into main Aug 14, 2024
3 checks passed

jeffreymeetkai deleted the combine-streaming-gsampling-fsm branch August 14, 2024 07:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor streaming and g-sampling FSM logics #239

Refactor streaming and g-sampling FSM logics #239

jeffreymeetkai commented Aug 8, 2024 •

edited

Loading

khai-meetkai Aug 12, 2024

jeffreymeetkai Aug 13, 2024

khai-meetkai Aug 12, 2024

jeffreymeetkai Aug 13, 2024

Refactor streaming and g-sampling FSM logics #239

Refactor streaming and g-sampling FSM logics #239

Conversation

jeffreymeetkai commented Aug 8, 2024 • edited Loading

khai-meetkai Aug 12, 2024

Choose a reason for hiding this comment

jeffreymeetkai Aug 13, 2024

Choose a reason for hiding this comment

khai-meetkai Aug 12, 2024

Choose a reason for hiding this comment

jeffreymeetkai Aug 13, 2024

Choose a reason for hiding this comment

jeffreymeetkai commented Aug 8, 2024 •

edited

Loading