Skip to content

Quantize: use --pure, --output-tensor-type and --token-embedding-type as the same time #11074

Quantize: use --pure, --output-tensor-type and --token-embedding-type as the same time

Quantize: use --pure, --output-tensor-type and --token-embedding-type as the same time #11074