Skip to content

ROCm 2.9 merge develop into master#690

Merged
amcamd merged 44 commits into
ROCm:masterfrom
amcamd:master
Sep 11, 2019
Merged

ROCm 2.9 merge develop into master#690
amcamd merged 44 commits into
ROCm:masterfrom
amcamd:master

Conversation

@amcamd
Copy link
Copy Markdown
Contributor

@amcamd amcamd commented Sep 11, 2019

No description provided.

amcamd and others added 30 commits August 7, 2019 17:00
merge master into develop for ROCm 2.8
* Changes to CMake to support changes in Tensile

* Fixing transposed arguments.

* Adding missing dependency

* Adding additional required dependencies and CMake flags.

* Fixing transposed arguments.
1. complex gemm(_ex) and gemm_strided_batched(ex) implementation
2. rocblas test and benchmark for complex
…all script (ROCm#646)

* Adding OpenMP and pthreads to rocBLAS cmake files

* added llvm to dockerfile
add tuned logic files for resnet and inception sizes
* Remove \0s from xargs input, to avoid corrupted characters
a separate client package can be built optionally with make package_clients
Pinning Tensile version to allow for stable Jenkins runs
* Enable unit tests for gemv_batched and gemv_strided_batched

- Add function templates for gemv_strided_batched and gemv_batched (in rocblas_gemv.hpp)
  to enable correct calls of these functions from other functions or from outside rocblas.
- Add batch and strides checks and quick return in rocblas_gemv_batched.cpp and rocblas_gemv_strided_batched.cpp
- Add unit tests testing_gemv_batched.hpp and testing_gemv_strided_batched.hpp
- Add new class device_batch_vector in rocblas_vector.hpp. Needed for the batched case.
- Add new template headers to rocblas.hpp
- Add new template header and especializations for norm_check_general to work with the batched case (in norm.hpp and norm.cpp)
- Add new template and espcializations for unit_check_general to work with the batched case (in unit.hpp)
- Add new arguments, stride_x and stride_y (needed to test gemv_strided_batched) in rocblas_arguments.hpp and
  rocblas_common.yaml. Set stride_x and stride_y defaults to zero in rocblas_common.yaml to correctly generate the tests of
  those functions that do not need these arguments
- Include the new tests in client.cpp as well as a description of the new arguments
- Add the new functions in rocblas_template.yaml to process YAML from log files
- Add batched and strided_batched template test cases in gemv_gtest.cpp
- Add new yaml test-data files gemv_batched_gtest.yaml and gemv_strided_batched_gtest.yaml
- Include the new yaml files in rocblas_gtest.yaml
- Add the new yaml files to the list of dependencies for rocblas_gtest.data in CMakeLists.txt

* Clang formatting

* Resolve merge conflicts

* clang formating

* Correct bugs in gemv complex
* Updating to new Tensile cmake

* Updating to latest Tensile tag
* Addition of rocblas_half and rocblas_bfloat16 precisions for dot.
Switching to use BLIS as the CPU reference library and reduce test duration significantly
Added rot, rotg, rotm, rotmg and test code, real and complex.
…s_abs() function (ROCm#678)

* Replace explicit conversion of bloat16 to float and double with implicit conversion of bfloat16 to float

* Fix std::abs for rocblas_bfloat16

* Change to using rocblas_abs instead of std::abs for when __device__ and __host__ are both required
@amcamd amcamd merged commit 5a107fa into ROCm:master Sep 11, 2019
mlse-lib-jenkins pushed a commit that referenced this pull request May 21, 2021
* Removing g_ prefix.

* Adding w_ prefix.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.