(Translated by https://www.hiragana.jp/)
[2405.16136] C3LLM: Conditional Multimodal Content Generation Using Large Language Models