Skip to content

Conversation

@Ronnoc
Copy link

@Ronnoc Ronnoc commented Jun 20, 2023

In get_compute_capability, the last item of ccs (ccs[-1]) will be selected as the maximum compute capability.
But ccs are not sorted in get_compute_capabilities
This PR adds sorting to get_compute_capabilities

@Ronnoc Ronnoc marked this pull request as ready for review June 20, 2023 06:57
@Ronnoc
Copy link
Author

Ronnoc commented Jun 20, 2023

@TimDettmers Please help review this Pull Request. Thank you.

Copy link
Contributor

@Dessix Dessix left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Try sorting using sort-by functionality.

@github-actions
Copy link

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

@github-actions github-actions bot closed this Dec 29, 2023
@TimDettmers
Copy link
Collaborator

Thank you for catching this! I think a question is if we should select the highest or lowest compute capability. The lowest would give the most broad support for multiple GPU. However, it is common that one uses an old GPU for monitors and a new GPU for deep learning work. As such, one would want to select the highest compute capability. @Titus-von-Koeller @younesbelkada any thoughts?

@TimDettmers TimDettmers reopened this Jan 1, 2024
@TimDettmers
Copy link
Collaborator

I will merge #703 instead as it is more comprehensive, but lets chat a bit about which how we should sort the compute capability

@Ronnoc
Copy link
Author

Ronnoc commented Jan 18, 2024

closed this PR as it's included in another one

@Ronnoc Ronnoc closed this Jan 18, 2024
@Titus-von-Koeller
Copy link
Collaborator

I think a question is if we should select the highest or lowest compute capability. The lowest would give the most broad support for multiple GPU. However, it is common that one uses an old GPU for monitors and a new GPU for deep learning work. As such, one would want to select the highest compute capability

I agree, highest is a good choice. We'll see if this is contrary to people's needs or expectations. If that is the case, we can add some logic to make it configurable.

Thanks a lot @Ronnoc for your contribution!

akx added a commit to akx/bitsandbytes that referenced this pull request Jan 30, 2024
The "sort compute capabilities" fix from bitsandbytes-foundation#703 (bitsandbytes-foundation#527) would actually do nothing due to this.
@akx akx mentioned this pull request Jan 30, 2024
Titus-von-Koeller pushed a commit that referenced this pull request Feb 4, 2024
* Diagnostics: streamline debug printing code

* CUDA setup: Remove unused `backup_paths`

* CUDA setup: DRY OS detection

* CUDA setup: Streamline `manual_override()`

* CUDA setup: Use comment instead of string literal, simplify

* CUDA setup: remove duplicate sort

The "sort compute capabilities" fix from #703 (#527) would actually do nothing due to this.

* CUDA setup: make version number replacement logic more obvious
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants