Skip to content

Conversation

@jeffra
Copy link
Collaborator

@jeffra jeffra commented Jan 4, 2021

When installing deepspeed with torch-cpu installed we sometimes (incorrectly) detected sparse attention was compatible when it is not. This would trigger a triton install which would fail. This came from ITP when trying to install deepspeed elasticity SDK. In the future I think we might need to add a new deepspeed-elastic repo/package since the SDK also does not need torch at all and we're getting complaints about that too. This is a bandaid to get it working for them though.

@jeffra jeffra merged commit 24e0739 into master Jan 4, 2021
@jeffra jeffra deleted the jeffra/sparse-attn-check branch January 4, 2021 20:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants