Commit 6452b4a
authored
mxfp8 inference roofline: add fusion to observed (#3223)
Summary:
Adds option to benchmark with relu -> linear to capture the impact
of fusing the activation to the quant kernel
Test Plan:
```bash
(pt_nightly_312_2) [[email protected] ~/local/ao
(20251021_inference_fusion_modeling)]$ python
benchmarks/float8/float8_inference_roofline.py ~/local/tmp/test.csv
--recipe_name mxfp8_cublas --shape_gen_name pow2_extended
--enable_fusion_modeling True
```
Reviewers:
Subscribers:
Tasks:
Tags:1 parent 13595c5 commit 6452b4a
1 file changed
+9
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
168 | 168 | | |
169 | 169 | | |
170 | 170 | | |
| 171 | + | |
171 | 172 | | |
172 | 173 | | |
173 | 174 | | |
| |||
176 | 177 | | |
177 | 178 | | |
178 | 179 | | |
| 180 | + | |
179 | 181 | | |
180 | 182 | | |
181 | 183 | | |
| |||
184 | 186 | | |
185 | 187 | | |
186 | 188 | | |
| 189 | + | |
187 | 190 | | |
188 | 191 | | |
189 | 192 | | |
| |||
194 | 197 | | |
195 | 198 | | |
196 | 199 | | |
| 200 | + | |
197 | 201 | | |
198 | 202 | | |
199 | 203 | | |
| |||
287 | 291 | | |
288 | 292 | | |
289 | 293 | | |
290 | | - | |
291 | | - | |
292 | | - | |
| 294 | + | |
| 295 | + | |
| 296 | + | |
| 297 | + | |
| 298 | + | |
293 | 299 | | |
294 | 300 | | |
295 | 301 | | |
| |||
0 commit comments