Explicitly pass expert_tensor_parallel_size to initialize_model_parallel
#537
Loading
expert_tensor_parallel_size to initialize_model_parallel
#537