(Translated by https://www.hiragana.jp/)
[2406.18485] LoongTrain: Efficient Training of Long-Sequence LLMs with Head-Context Parallelism