Currently it's used by ExecuTorch I think, we should deprecate the APIs in GPTQ.py and move to new APIs