Skip to main content

Tensor Parallelism

Megatron-style tensor parallelism for giant models.