OPEA
/

MiniMax-Text-01-int4-sym-inc-preview

4-bit precision

intel/auto-round

Model card Files Files and versions Community

cicdatopea commited on 13 days ago

Commit

d26a0e7

·

verified ·

1 Parent(s): 5e8f238

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ pip3 install git+https://github.com/intel/auto-round.git@bf16_inference
 pip3 install auto-gptq
 ```
-**This model is prone to overflow when running with FP16 int4 kernel dtype** and does not support CPU execution, as it explicitly relies on CUDA operations in the model files. While we have implemented several workarounds to ensure functionality, **some prompts may still produce unexpected and random outputs**.
 ~~~python
 from auto_round import AutoRoundConfig  ##must import for autoround format

 pip3 install auto-gptq
 ```
+**This model is prone to overflow when running with int4 kernel with FP16 computation dtype** and does not support CPU, as it explicitly relies on CUDA operations in the model files. While we have implemented several workarounds to ensure functionality, **some prompts may still produce unexpected and random outputs**.
 ~~~python
 from auto_round import AutoRoundConfig  ##must import for autoround format