-
-
Notifications
You must be signed in to change notification settings - Fork 873
BitsandBytes Enablement on ROCm #1207
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Titus-von-Koeller
merged 132 commits into
bitsandbytes-foundation:multi-backend-refactor
from
ROCm:device_abstraction
May 24, 2024
+6,028
−54
Merged
Changes from 131 commits
Commits
Show all changes
132 commits
Select commit
Hold shift + click to select a range
2e10f67
hipify the csrc repo
Lzy17 1928960
hipify pythoninterface
Lzy17 8ca0b5c
copy from agrocylo
Lzy17 8acbcf2
hipify cuparse and cublas calls
Lzy17 e80a60c
fix compile error and Makefile
Lzy17 fb780a0
fixed runtime error (low accuracy)
Lzy17 1048264
FIX LOW ACCURACY
Lzy17 c330020
Update README.md
Lzy17 fcee2d6
add benchmarks
Lzy17 4c0ca08
Update README.md
Lzy17 c798616
First draft, getting error
jpvillam-amd 37045e5
Small transform fix, still errors on igemm
jpvillam-amd 524fa57
create HIP_ENVIRONMENT variable
pnunna93 d7f7a82
Skip failing tests on rocm
pnunna93 28b8056
Add default value for HIP_ENVIRONMENT
pnunna93 9dca4fa
Merge pull request #1 from ROCmSoftwarePlatform/skip_rocm_failing_tests
amathews-amd 38c934e
skip failing triton tests on rocm
pnunna93 71bf2df
Merge pull request #2 from ROCmSoftwarePlatform/skip_triton
amathews-amd 657ca4b
Enable col to row transformation
pnunna93 a390e0c
Add make functions for row to col transformation
pnunna93 99ad6b5
Update get_transform_buffer for row to col in HIP
pnunna93 039b808
Update igemmlt for col format
pnunna93 1a052ee
Unskip test_igemmlt_int on ROCm
pnunna93 b7ca5cf
Update igemmlt_int test for col inputs
pnunna93 a2cd90d
Skip transpose igemmlt test on ROCm
pnunna93 5b6c5ac
Revert "Update igemmlt_int test for col inputs"
pnunna93 218bf66
Return nvidia_transform from transform for HIP
pnunna93 8bb5c2f
Fix syntax error
pnunna93 eb2edf7
Add comment for shape change
pnunna93 a38ea0f
Enable nvidia_transform tests
pnunna93 fbacd7a
Merge branch 'fix_igemmlt_int' of https://github.com/pnunna93/bitsand…
pnunna93 67c383b
Enable igemmlt_half tests
pnunna93 42b860f
Revert col32 check in nvidia_transform test
pnunna93 7198d6b
Merge pull request #3 from pnunna93/fix_igemmlt_int
amathews-amd b1d484a
Merge remote-tracking branch 'upstream/main' into IFU-master-2024-01-24
pnunna93 c36085d
Update README.md
Lzy17 0e91e48
Update hip files with upstream changes
pnunna93 1295d53
Skip failing tests for now
pnunna93 48b7fa9
Merge pull request #4 from ROCm/IFU-master-2024-01-24
amathews-amd f1a0b8b
ops.hip: adapt to enum naming changes in ROCm/hipBLASLt@95131d6 and R…
iiisak a84c369
fix wmma api parity
Lzy17 b044010
hipify wmma datatype
Lzy17 7aa42be
Enable estimate quantile tests
pnunna93 85377e1
Merge pull request #5 from iiisak/rocm_enabled
pnunna93 ffb0c5d
Merge pull request #7 from ROCm/fix_estimate_quantiles
amathews-amd 2b77380
Merge pull request #6 from ROCm/rocwmma_merge
Lzy17 fad7918
Enable transpose flag for row to col transform
pnunna93 e3021ee
Update descriptors for transpose flag
pnunna93 8c3476f
revert nvidia_transform to transform
pnunna93 5e1b152
update changes
386e16c
Merge pull request #8 from ROCm/enable_transform_with_transpose
pnunna93 389bb7d
fixed minor mistakes
b6770bf
Merge pull request #9 from ROCm/rocm_enabled_fix_bfloat16
pnunna93 fa28828
remove blocksize 64 on rocm
pnunna93 d86d24c
remove block size 64 and enable remaining tests
pnunna93 cf4a506
Fix cuda build errors
pnunna93 7077195
remove workspace in igemmlt
pnunna93 ec32fc1
Enabled igemmlt in matmul
pnunna93 4536b25
Fix shape issue in transform function
pnunna93 66e34c1
Enable igemmlt int8 output
pnunna93 7e5e223
Add col format for extract outliers
pnunna93 2e42adb
Enable dequant_mm
pnunna93 e32d277
Enable matmullt tests
pnunna93 8206bd1
Enabled linear_serialization tests
pnunna93 973a9f8
fix error with dequant_mm change
pnunna93 387a9b7
Enable extract outliers test
pnunna93 93dfb51
Enable test overflow
pnunna93 90bbdc6
Skip overflow and linear serialization for now
pnunna93 9890d5d
Merge pull request #10 from ROCm/remove_blocksize_64
pnunna93 1b6dd48
Merge pull request #11 from ROCm/fix_cuda_build_errs
pnunna93 fc9bf4d
Merge pull request #12 from ROCm/igemm_workspace
pnunna93 f30dc38
Merge pull request #13 from ROCm/enable_matmul
pnunna93 3dc14e8
improve the gemv 4bit accuracy by forcing the hipcub to 32
f4ac9ac
Merge pull request #14 from ROCm/fix_gemv_4bit
Lzy17 485ba8f
Update skip comment
pnunna93 a36bd1d
Merge pull request #15 from ROCm/gemv_skip_comment
pnunna93 a551c16
Merge remote-tracking branch 'upstream/main' into IFU-master-2024-03-28
pnunna93 a267221
update instructions
bcdcc0b
Merge pull request #19 from ROCm/updated_readme
amathews-amd ff33371
Update README.md
pnunna93 1157e73
Merge branch 'rocm_enabled' into IFU-master-2024-03-28
pnunna93 702ca1a
fix PEP errors
pnunna93 8c23dc0
Fix typos
pnunna93 971f4b1
Merge branch 'IFU-master-2024-03-28' of https://github.com/ROCm/bitsa…
pnunna93 4d6408a
Fix formatting in README file
pnunna93 79cb554
Update gpu arch setting
pnunna93 5c0414e
Add ROCM_PATH variable
pnunna93 47795f5
Add HIP_VERSION variable
pnunna93 6d90452
Add BNB_HIP_VERSION variable
pnunna93 049a2dc
Update supports igemmlt based on HIP version
pnunna93 47a0bc3
Skip failing tests based on HIP version
pnunna93 1b2a095
pre-commit fixes
pnunna93 4515a21
Update README file
pnunna93 e7ef75f
Update default arch list
pnunna93 c0d244c
update readme
pnunna93 c037a30
Merge pull request #17 from ROCm/IFU-master-2024-03-28
lcskrishna 73f4f05
Merge remote-tracking branch 'TD_BnB/multi-backend-refactor' into dev…
pnunna93 79652a5
update igemmlt for hip
pnunna93 aedfa8f
Update mm_dequant for hip
pnunna93 7835282
Update transform function for hip
pnunna93 60d7560
adding arch detection for test_gemv_eye_4bit
cae33c3
implement get_rocm_gpu_arch
da53f39
fixing lint
ae4dcec
fixing lint
21d5ff6
correct lint error
5bada9b
Merge pull request #21 from ROCm/rocm_enabled_arch_detect
pnunna93 01abfde
Merge branch 'rocm_enabled' into device_abstraction
pnunna93 765bfc8
update extract_outliers, quantize_4bit, dequantize_4bit
lcskrishna d00c026
minor fixes for extract_outliers
lcskrishna e5574bd
update blocksizes for quantize and dequantize
lcskrishna a00bd1f
Merge branch 'rocm_enabled' of https://github.com/ROCm/bitsandbytes i…
7ab3a05
update reg expression for detecting arch
lcskrishna 9cd1d8c
linter updates
lcskrishna 62f8ed9
Merge branch 'device_abstraction' into cl/update-device-abs
lcskrishna d9e4803
Merge pull request #23 from ROCm/cl/update-device-abs
pnunna93 2af8568
Merge remote-tracking branch 'upstream/multi-backend-refactor' into d…
pnunna93 06f6b25
skip linear no igemmlt test
pnunna93 2359452
Remove archive functional file
pnunna93 f76d6ab
Sync README with upstream
pnunna93 576b62c
Remove bnb_accuracy file
pnunna93 dfb531b
Remove cuda_setup
pnunna93 31b1cbc
Remove test_delete_later.c
pnunna93 ed77476
Sync with upstream
pnunna93 943c57a
Sync files with upstream
pnunna93 71d1702
Fix lint errors
pnunna93 6886bc8
Exclude hip files from typo checks
pnunna93 0d445f4
update ops.hip
pnunna93 bc6d0b7
Merge pull request #27 from ROCm/dev_abs_IFU
lcskrishna 15c7f77
Add install steps for ROCm
pnunna93 d62c835
Fix lint error
pnunna93 8aae7c9
Merge pull request #28 from ROCm/dev_abs_add_install_steps
lcskrishna 410f499
Add comments for HIP changes
pnunna93 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -21,3 +21,4 @@ repos: | |
| rev: v1.18.2 | ||
| hooks: | ||
| - id: typos | ||
| exclude: ^.*\.hip$ | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there a short explanation we can add here to explain why this is the default, and likewise below why 64 is not supported?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Its because of warpsize difference between AMD and NVIDIA GPUs. I have added comments - 410f499