sparse benchmarking numbers #303

jcaip · 2024-06-03T13:13:01Z

Updated benchmark script for standalone sparse numbers.
Switched from segment-anything to segment-anything-fast
Updated README with results for segment-anything and BERT

pytorch-bot · 2024-06-03T13:13:04Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/303

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 1fbea59 with merge base 8a4e693 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

msaroufim · 2024-06-03T14:31:39Z

benchmarks/benchmark_sam.py

 import pandas as pd
+from segment_anything_fast import sam_model_registry


I missed that sam fast isn't included in our benchmarks - suggestion is maybe to put a sam folder under benchmarjs with a README on custom dependencies and how to install them or just add a comment above this line as to how people can install sam fast

msaroufim · 2024-06-03T14:33:25Z

torchao/sparsity/README.md

+
+#### BERT
+
+We were able to accelerate BERT 1.23x with a negligible accuracy drop on SQuAD.


which hardware?

msaroufim · 2024-06-03T14:36:21Z

torchao/sparsity/README.md

+
+#### segment-anything
+We applied 2:4 sparsity to accelerate segment-anything, as part of [segment-anything-fast](https://github.com/pytorch-labs/segment-anything-fast).
+The results mentioned in the REAADME of the repo compose sparsity with a suite of other inference acceleration techniques.


Suggested change

The results mentioned in the REAADME of the repo compose sparsity with a suite of other inference acceleration techniques.

The results mentioned in the README of the repo compose sparsity with a suite of other inference acceleration techniques.

msaroufim · 2024-06-03T14:38:22Z

torchao/sparsity/README.md

+From our benchmarking, we see a 1.1x speedup when running with SEGMENT_ANYTHING_FAST_USE_FLASH_4 enabled.
+
+```
+python benchmarks/benchmark_sam.py


Put a direct link to the benchmarks script

msaroufim · 2024-06-03T14:40:38Z

torchao/sparsity/README.md

+```
+python benchmarks/benchmark_sam.py
+
+   block_only  batchsize           dtype  compile                  qkv                 proj                 lin1                 lin2         time     memory      img/s


Can you format this as a table it's a bit hard to read because I have to scroll all the way to the right? Also what does lin1 and lin2 mean? Presumably memory is in GB?

Also I feel like a quick sentence before this table along the lines of we have support for SparseSemiStructuredTensor we can apply it to each of the qkv of attention or just the proj and here's how to apply it would be helpful

- Updated benchmark script for standalone sparse numbers. - Switched from segment-anything to segment-anything-fast - Updated README with results for segment-anything and BERT

jcaip added 2 commits June 3, 2024 05:38

update sam script

8f589a4

updated readme with results

243479c

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 3, 2024

fix formatting

94c2ecb

msaroufim requested changes Jun 3, 2024

View reviewed changes

jcaip added 2 commits June 3, 2024 07:51

cr feedback

a530f78

more cr feedback

e555fca

msaroufim approved these changes Jun 3, 2024

View reviewed changes

jcaip added 4 commits June 3, 2024 07:56

more cr feedback

6df5456

more cr feedback

de20761

add code ticks

1f265c4

sigfigs

1fbea59

jcaip merged commit e5d27a3 into main Jun 3, 2024
13 checks passed

yanbing-j pushed a commit to yanbing-j/ao that referenced this pull request Dec 9, 2024

handle path is None (pytorch#303)

21834b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparse benchmarking numbers #303

sparse benchmarking numbers #303

jcaip commented Jun 3, 2024

pytorch-bot bot commented Jun 3, 2024 •

edited

Loading

msaroufim Jun 3, 2024

msaroufim Jun 3, 2024

msaroufim Jun 3, 2024

msaroufim Jun 3, 2024

msaroufim Jun 3, 2024

		import pandas as pd
		from segment_anything_fast import sam_model_registry


		#### BERT

		We were able to accelerate BERT 1.23x with a negligible accuracy drop on SQuAD.

	The results mentioned in the REAADME of the repo compose sparsity with a suite of other inference acceleration techniques.
	The results mentioned in the README of the repo compose sparsity with a suite of other inference acceleration techniques.

sparse benchmarking numbers #303

sparse benchmarking numbers #303

Conversation

jcaip commented Jun 3, 2024

pytorch-bot bot commented Jun 3, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/303

✅ No Failures

msaroufim Jun 3, 2024

Choose a reason for hiding this comment

msaroufim Jun 3, 2024

Choose a reason for hiding this comment

msaroufim Jun 3, 2024

Choose a reason for hiding this comment

msaroufim Jun 3, 2024

Choose a reason for hiding this comment

msaroufim Jun 3, 2024

Choose a reason for hiding this comment

pytorch-bot bot commented Jun 3, 2024 •

edited

Loading