[Relax] Batch norm correctness on eval mode #17752

hugolatendresse · 2025-03-16T05:54:06Z

Batch_norm is a different operator in training and eval. The previous interface defaulted to the training mode and required changing an ingested pytorch program itself to use the eval mode. This is sub-ideal, especially since torch.export explicitely communicates whether batch_norm should be in training or eval in a given torch program.

This PR automates the selection of training/eval mode in the exported program translator, and achieves correctness for eval mode.

Future TODO: there is something wrong with batch_norm on training mode. It does not pass a correctness test when taken straight from the main branch (there's an issue with tensor dimensions). I added a note to address later as training mode is probably not high priority.

…ials

hugolatendresse added 18 commits March 10, 2025 13:22

trying to understand why batchnorm returns all zeros

8b5a81d

debugging training vs non-training batch norm

99373ae

merge main

4f93317

added training in attrs

b0e1154

training False

dde7872

training argument in nn.py

dff60db

little cleanup before building

f1986d9

fix copy-paste errors

1545b99

builds, but should probably just update nn.h instead

77cc1d8

batch_norm build

0dbf8fe

first batchnorm test passes with .eval(), but not without, and copy f…

1164d21

…ials

copy failing

a72ce6e

todo

42728f7

cleanup

9ee0672

training failing

e3f0236

no need to pass center and scale since default ok

3f68087

cleanup

5cd314d

cleanup

d5d30b7

hugolatendresse changed the title ~~[Relax] Fix batch norm ingestion~~ [Relax] Batch norm correctness on eval mode Mar 16, 2025

hugolatendresse added 11 commits March 16, 2025 13:17

reformat

125a9a6

Merge branch 'main' into batch_norm

281fb53

batch norm default and print torch version

3f0eaea

whitespace

79e3ec6

remove dummy test

79c4a0e

Merge branch 'main' of https://github.com/apache/tvm into batch_norm

2dc643e

getting a tuple as output of batchnorm

b9697f3

output now of the right dimension, and close! but is not exactly equal

bc18182

still not the same with 2 1 2 2

e2e7263

missing eps

4cdb05a

last small test passes, but most tests still fail

b256163

hugolatendresse added 4 commits March 21, 2025 15:51

passes

ab8d75c

passes

7cb5a56

need to fix test_batch_norm7

536310a

commented out tests that pass

4c55f20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Relax] Batch norm correctness on eval mode #17752

[Relax] Batch norm correctness on eval mode #17752

hugolatendresse commented Mar 16, 2025 •

edited

Loading

[Relax] Batch norm correctness on eval mode #17752

Are you sure you want to change the base?

[Relax] Batch norm correctness on eval mode #17752

Conversation

hugolatendresse commented Mar 16, 2025 • edited Loading

hugolatendresse commented Mar 16, 2025 •

edited

Loading