why llmc do not support asymmetric quantization? #258

hensiesp32 · 2024-12-13T03:07:13Z

Hi, thanks for your wonderful work. I wonder why llmc's quantization doesn't support asymmetric quantization if i set save_vllm = True
The config is as below:

base:
    seed: &seed 42
model:
    type: Qwen2
    path: /mnt/public/lingo-engine/model_info/multidoc_masterthemev11_qwen2_7b_bf16_240904
    tokenizer_mode: slow
    torch_dtype: auto
calib:
    name: pileval
    download: False
    path: /mnt/public/lingo-engine/data/pileval_dataset/
    n_samples: 128
    bs: -1
    seq_len: 512
    preproc: general
    seed: *seed
quant:
    method: Awq
    weight:
        bit: 8
        **symmetric: False**
        granularity: per_channel
        group_size: -1
    act:
        bit: 8
        **symmetric: False**
        granularity: per_token
    special:
        trans: True
        trans_version: v2
        weight_clip: True
    quant_out: True
save:
    save_vllm: True
    save_path: /mnt/public/daixin/masterthemev11_qwen2_7b_awq_w8a8_unsym

Then I get error:

[rank0]: Traceback (most recent call last):
[rank0]:   File "/mnt/user/daixin/llmc/llmc/__main__.py", line 317, in <module>
[rank0]:     main(config)
[rank0]:   File "/mnt/user/daixin/llmc/llmc/__main__.py", line 194, in main
[rank0]:     assert w.symmetric, 'Only symmetric quant is supported.'

llmc/llmc/__main__.py

Line 194 in 5e3361c

assert w.symmetric, 'Only symmetric quant is supported.'

The text was updated successfully, but these errors were encountered:

gushiqiao · 2024-12-13T03:15:05Z

This is because the vllm backend only supports symmetric quantized inference.

hensiesp32 changed the title ~~why llmc not support asymmetric quantization?~~ why llmc do not support asymmetric quantization? Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

why llmc do not support asymmetric quantization? #258

why llmc do not support asymmetric quantization? #258

hensiesp32 commented Dec 13, 2024

gushiqiao commented Dec 13, 2024

why llmc do not support asymmetric quantization? #258

why llmc do not support asymmetric quantization? #258

Comments

hensiesp32 commented Dec 13, 2024

gushiqiao commented Dec 13, 2024