Flops regularizer looks odd #41

dangkhoasdc · 2025-03-20T03:42:29Z

This implement does not look similar to the formula mentioned in SPLADE paper. Also, to minimize this, the 2nd operand need to be equal to threshold, which is not the goal of FLOPS.

neural-cherche/neural_cherche/losses/flops.py

Line 94 in 2df0214

input=torch.mean(input=torch.abs(input=activations), dim=0) ** 2, dim=0

There is another implementation more akin to the formula:

https://github.com/thongnt99/learned-sparse-retrieval/blob/d702026aacf1ab7c47011f55edcb2646a6bb646d/lsr/losses/regularizer.py#L56

The text was updated successfully, but these errors were encountered:

raphaelsty · 2025-03-20T08:49:05Z

Hi @dangkhoasdc, I wrote this so it act as a margin-based flop loss, the model is asked to achieve a certain amount of flops.

Feel free to make a MR which will provide by default the correct flop loss :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flops regularizer looks odd #41

Flops regularizer looks odd #41

dangkhoasdc commented Mar 20, 2025 •

edited

Loading

raphaelsty commented Mar 20, 2025

Flops regularizer looks odd #41

Flops regularizer looks odd #41

Comments

dangkhoasdc commented Mar 20, 2025 • edited Loading

raphaelsty commented Mar 20, 2025

dangkhoasdc commented Mar 20, 2025 •

edited

Loading