Explicitly pass expert_tensor_parallel_size to initialize_model_parallel#537
Merged
Merged
Loading
expert_tensor_parallel_size to initialize_model_parallel#537