Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

First batch of cherry-picks to 1.6 branch (#16711, #16737, #16408) #16763

Merged
merged 4 commits into from
Nov 8, 2019

Commits on Nov 8, 2019

  1. Configuration menu
    Copy the full SHA
    3026bae View commit details
    Browse the repository at this point in the history
  2. [MKLDNN] use dim_t instead of int in slice/transpose operators (apach…

    …e#16737)
    
    * use dim_t instead of int
    
    * fix same issue in pooling
    
    * rebase code
    
    * trigger CI
    wuxun-zhang authored and ptrendx committed Nov 8, 2019
    Configuration menu
    Copy the full SHA
    f93f69c View commit details
    Browse the repository at this point in the history
  3. Add MXNet Ops for fast multihead attention (apache#16408)

    * add MXNet Ops for fast multihead attention
    
    * add cutlass as 3rdparty dependency
    
    * add cutlass to compilation flags
    
    * remove all cutlass stuff
    
    * add better error message and description and remove cutlass from compilation flags
    
    * change credit for the approach since the code have changed
    
    * fix typos
    
    * correct another typo
    
    * Add all the cuda/cublas helper functions
    
    * remove tests using kAddTo
    
    * only use cublasStridedBatchedGemm if CUDA >= 9.1
    
    * add equivalent mxnet code in description of mha ops
    
    * remove a wrong copy-paste
    
    * add _contrib for namespace and add GPU only on description
    
    * add warning in bwd_ignore_zero_init description, also test with fp32
    
    * add error return if bwd_ignore_zero_init is used without MXNET_EXEC_ENABLE_ADDTO
    
    * remove std::move for clang
    
    * remove bwd_ignore_zero_init flag
    
    * remove bwd_ignore_zero_init in test_operator_gpu.py
    
    * fix typo
    
    * fix another typo
    Caenorst authored and ptrendx committed Nov 8, 2019
    Configuration menu
    Copy the full SHA
    0af550a View commit details
    Browse the repository at this point in the history
  4. Removed unrelated test

    ptrendx committed Nov 8, 2019
    Configuration menu
    Copy the full SHA
    2df187c View commit details
    Browse the repository at this point in the history