說明

Llama 3.3 70B training is optimized for AMD GPUs