Convert tools

xFasterTransformer supports a different model format than huggingface, compatibe with NVIDIA FasterTransformer's format. The tools are used to dump Huggingface models parameters on every layer to binary for xFasterTransformer code on CPU.

Step 1: Download the huggingface format model firstly.

Step 2: Run convert script corresponding to the model.

After that, convert the model into xFasterTransformer format using the script. You will see many bin files in the output directory.

    python chatglm_convert.py -i ${HF_DATASET_DIR} -o  ${OUTPUT_DIR}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Convert tools

Step 1: Download the huggingface format model firstly.

Step 2: Run convert script corresponding to the model.

Files

README.md

Latest commit

History

README.md

File metadata and controls

Convert tools

Step 1: Download the huggingface format model firstly.

Step 2: Run convert script corresponding to the model.