TPU v3-8 Memory Capacity and Configuration #19927

yixiaoer · 2024-02-22T15:11:18Z

yixiaoer
Feb 22, 2024

I observed that I could allocate approximately 16 GB of parameters per device, leading me to infer that the total memory across the TPU v3-8 might be around 128 GB (16 GB per device * 8 devices).

import jax
import jax.numpy as jnp
from jax_smi import initialise_tracking
initialise_tracking()

dtype = jnp.float32
bytes_per_gb = 1024**3
dtype_size = jnp.dtype(dtype).itemsize
element_half_gb = bytes_per_gb // dtype_size // 4

allocated_arrays = []
total_size_gb = 0.

while True:
    try:
        arr = jnp.ones(element_half_gb, dtype=dtype)
        allocated_arrays.append(arr)
        total_size_gb += 0.25
    except RuntimeError as e:
        break
print(f'Finally load {total_size_gb} GB to one device')

The above code gets output with "Finally load 15.25 GB to one device", and we can also check the capacity with jax-smi in the terminal:

However, the official documentation (https://cloud.google.com/tpu/docs/system-architecture-tpu-vm#tpu_v3) mentions that "Each v3 TPU chip contains two TensorCores" and "HBM2 capacity and bandwidth: 32 GiB, 900 GBps".

What is the meaning of "TensorCores" and the reference to 32 GiB of capacity?

Answered by hawkinsp

Feb 22, 2024

That's correct. TensorCore in this terminology is the "dense compute" core of a TPU (*). To JAX, these appear as "devices". In a TPU v3-8 you have 4 chips, 2 cores per chip, and each core has 16GB of HBM. So this looks to JAX like 8 devices with 16GB each.

(*) "TensorCore" here is unrelated to NVIDIA's hardware unit with the same name.

View full answer

hawkinsp · 2024-02-22T16:18:07Z

hawkinsp
Feb 22, 2024
Maintainer

That's correct. TensorCore in this terminology is the "dense compute" core of a TPU (*). To JAX, these appear as "devices". In a TPU v3-8 you have 4 chips, 2 cores per chip, and each core has 16GB of HBM. So this looks to JAX like 8 devices with 16GB each.

(*) "TensorCore" here is unrelated to NVIDIA's hardware unit with the same name.

1 reply

yixiaoer Feb 23, 2024
Author

Thanks for the explanation!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TPU v3-8 Memory Capacity and Configuration #19927

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

TPU v3-8 Memory Capacity and Configuration #19927

yixiaoer Feb 22, 2024

Replies: 1 comment · 1 reply

hawkinsp Feb 22, 2024 Maintainer

yixiaoer Feb 23, 2024 Author

yixiaoer
Feb 22, 2024

Replies: 1 comment 1 reply

hawkinsp
Feb 22, 2024
Maintainer

yixiaoer Feb 23, 2024
Author