usage

format conversion

Given the network issues, the tutorial here requires the MBPP dataset on Huggingface to be downloaded. I have already downloaded it, it is test-00000-of-00001.parquet. the download address is below:

https://huggingface.co/datasets/google-research-datasets/mbpp/tree/main/sanitized

In order to adapt to the now mainstream jsonl representation (since I won't be using parquet (つω`.)), you need to convert parquet to jsonl

The conversion file is par_jsonl.py and the result of the test-00000-of-00001.parquet conversion is MBPP.jsonl

$ python par_jsonl.py

Answer Generation

Call the API to evaluate the LLM (you need to fill in the API_key before that)

$ python MBPP_completion.py

The sample result is MBPP_samples.jsonl

Evaluation of results

Run the python code on your own system and calculate Pass@1

$ python evaluate_MBPP.py

The generated result is MBPP_results.jsonl

Sample Error Summary

$ python all_false.py

Error cases are summarized in failed_tasks.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

usage

format conversion

Answer Generation

Evaluation of results

Sample Error Summary

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
prompt		prompt
.gitattributes		.gitattributes
MBPP.jsonl		MBPP.jsonl
MBPP_completion.py		MBPP_completion.py
MBPP_results.jsonl		MBPP_results.jsonl
MBPP_samples.jsonl		MBPP_samples.jsonl
README.md		README.md
all_false.py		all_false.py
evaluate_MBPP.py		evaluate_MBPP.py
failed_tasks.txt		failed_tasks.txt
gpt-4o_samples.jsonl		gpt-4o_samples.jsonl
par_jsonl.py		par_jsonl.py
test-00000-of-00001.parquet		test-00000-of-00001.parquet

Qlalq/MBPP

Folders and files

Latest commit

History

Repository files navigation

usage

format conversion

Answer Generation

Evaluation of results

Sample Error Summary

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages