Fine-tuning LLM on multi GPUs

Fine-tuning LLM on multi GPUs