Skip to content

Diffusion Transformer Training Pipeline #5410

Diffusion Transformer Training Pipeline

Diffusion Transformer Training Pipeline #5410

L2_Megatron_GPT_SFT_Eval_inference_seq_len_greaterThan_training_seq_len  /  main

succeeded Oct 13, 2024 in 1m 53s