Why is Flax Linear not identical to matrix multiplication? #4020

thijs-vanweezel · 2024-06-23T21:31:34Z

thijs-vanweezel
Jun 23, 2024

Due to the novelty of Flax, NNX, and JAX, there’s not a lot of resources available. I’m running into the following peculiarity:

x = jnp.random.normal((1,512), key=KEY)
layer = nnx.Linear(512, 512, rngs=nnx.Rngs(KEY))
y1 = layer(x)
y2 = [email protected]() + layer.bias
print(y1==y2) # returns all False

My understanding is that matrix multiplication should be identical to a linear / fully connected layer. The discrepancy demonstrated here hinders the inspection of certain behavior (and the implementation of invertible dense layers using nnx.tensorsolve).

Does anyone know what causes this discrepancy?

Answered by thijs-vanweezel

Jun 24, 2024

The answer has been provided on Stackoverflow.

The matmul should be transposed;

y = x.squeeze()@layer.kernel + layer.bias

So, to invert a nnx.Linear operation using nnx.tensorsolve:

solve_batched = jax.vmap(jnp.linalg.tensorsolve)
solve_batched(
    a=jnp.broadcast_to(
        layer.kernel.value.T, # Note the transposition
        (y.shape[0],*layer.kernel.value.shape)), 
    b=y - layer.bias)

View full answer

thijs-vanweezel · 2024-06-24T10:30:31Z

thijs-vanweezel
Jun 24, 2024
Author

The answer has been provided on Stackoverflow.

The matmul should be transposed;

y = x.squeeze()@layer.kernel + layer.bias

So, to invert a nnx.Linear operation using nnx.tensorsolve:

solve_batched = jax.vmap(jnp.linalg.tensorsolve)
solve_batched(
    a=jnp.broadcast_to(
        layer.kernel.value.T, # Note the transposition
        (y.shape[0],*layer.kernel.value.shape)), 
    b=y - layer.bias)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is Flax Linear not identical to matrix multiplication? #4020

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Why is Flax Linear not identical to matrix multiplication? #4020

thijs-vanweezel Jun 23, 2024

Replies: 1 comment

thijs-vanweezel Jun 24, 2024 Author

thijs-vanweezel
Jun 23, 2024

thijs-vanweezel
Jun 24, 2024
Author