(Translated by https://www.hiragana.jp/)
[2406.07007] Crayon: Customized On-Device LLM via Instant Adapter Blending and Edge-Server Hybrid Inference