(Translated by https://www.hiragana.jp/)
[2307.03738] QIGen: Generating Efficient Kernels for Quantized Inference on Large Language Models