Summary

  • Weight-only PTQ.
  • Incoherence processing using Randomized Hadamard Transform (RHT).
  • Vector quantization
  • Fine-tuning (lite-QAT)