Implementing more CodeLMs #41

yilunzhao · 2023-04-14T16:47:10Z

Working on #30 for this PR

niansong1996

Good work! Some questions:

Do you have to update the huggingface transformers library version?
On what device/GPU did you test the mathqa setting?

niansong1996 · 2023-04-14T18:17:14Z

execution/executors.py

+            # for llama-based model
+            return output.lstrip().split("\n\n")[0].strip()
+        else:
+            return output.lstrip().split(tokenizer_eos_token)[0].split("\n\n")[0].strip()


So LLAMA does not have an eos token?

It defines the eos token as empty string: https://huggingface.co/decapoda-research/llama-7b-hf/blob/main/tokenizer_config.json

yilunzhao · 2023-04-14T18:36:26Z

Do you have to update the huggingface transformers library version?

Yes. There has not been an official release that supports LLAMA, so I installed the transformer library from source.

pip install git+https://github.com/huggingface/transformers

On what device/GPU did you test the mathqa setting?

The experimental setting was recorded in this google doc. I will use this doc for updates.

niansong1996 · 2023-04-14T20:27:02Z

To reiterate the action items discussed in the meeting:

add the command to install the library so we can replicate the results

Also I was wondering if this PR is ready to merge? You also mentioned that there are some edge cases that haven't been handled?

…santacoder

yilunzhao · 2023-04-14T23:05:28Z

add the command to install the library so we can replicate the results

I just updated the requirements.txt in the second commit.

Also I was wondering if this PR is ready to merge? You also mentioned that there are some edge cases that haven't been handled?

Yes. LLAMA, Alpaca, and santacoder should work fine using the config file: /home/lily/yl2465/code/NLP4Code/finetuning/training_configs/few_shot/mathqa-8_fixed_mathqa_shots_llama.yaml

niansong1996 · 2023-04-20T18:55:43Z

@yilunzhao Can you resolve the conflicts and also merge from main to run the CI tests?

niansong1996 · 2023-04-22T19:53:05Z

Great, merging this PR now

niansong1996 · 2023-04-23T18:36:15Z

@yilunzhao Check my comments on #46

Since we can't reopen a merged PR, can you submit a new PR and point it to this PR instead?

Let me know if you have any questions. Sorry about the confusion.

yilunzhao · 2023-04-25T20:56:06Z

Hi @niansong1996, I have submit a new PR #48, could you please have a look at it?
This time, I add an instruction in README about how to install transformers library from source to run LLAMA-based models, rather than modifying the requirement.txt.

implemented llama, alpaca, santacoder

a6d3995

niansong1996 reviewed Apr 14, 2023

View reviewed changes

update transformer version in requirement.txt; disable do_sample for …

fd4f500

…santacoder

Merge branch 'main' into yilunzhao/llm_implementation

7898d2a

niansong1996 merged commit 752d801 into main Apr 22, 2023

niansong1996 mentioned this pull request Apr 23, 2023

Revert "Implementing more CodeLMs" #46

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing more CodeLMs #41

Implementing more CodeLMs #41

yilunzhao commented Apr 14, 2023

niansong1996 left a comment

niansong1996 Apr 14, 2023

yilunzhao Apr 14, 2023

yilunzhao commented Apr 14, 2023 •

edited

Loading

niansong1996 commented Apr 14, 2023

yilunzhao commented Apr 14, 2023

niansong1996 commented Apr 20, 2023

niansong1996 commented Apr 22, 2023

niansong1996 commented Apr 23, 2023

yilunzhao commented Apr 25, 2023

Implementing more CodeLMs #41

Implementing more CodeLMs #41

Conversation

yilunzhao commented Apr 14, 2023

niansong1996 left a comment

Choose a reason for hiding this comment

niansong1996 Apr 14, 2023

Choose a reason for hiding this comment

yilunzhao Apr 14, 2023

Choose a reason for hiding this comment

yilunzhao commented Apr 14, 2023 • edited Loading

niansong1996 commented Apr 14, 2023

yilunzhao commented Apr 14, 2023

niansong1996 commented Apr 20, 2023

niansong1996 commented Apr 22, 2023

niansong1996 commented Apr 23, 2023

yilunzhao commented Apr 25, 2023

yilunzhao commented Apr 14, 2023 •

edited

Loading