Summary Static channel-wise weight pruning by utilizing computational invariance. Apply PCA (eigen decomposition) to calibration dataset to obtain transformation matrix Ql. Ql is merged into linear weights. LayerNorm is converted to RMSNorm. ∥x∥2=i∑xi2=x⊤x⟹∥Qx∥2=x⊤Q⊤Qx=∥x∥2⟹RMSNorm(XQ)Q⊤=RMSNorm(X)