- 
                Notifications
    You must be signed in to change notification settings 
- Fork 13.4k
ggml : implement REGLU/GEGLU/SWIGLU ops #14158
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
          
     Merged
      
        
      
    
  
     Merged
                    Changes from all commits
      Commits
    
    
            Show all changes
          
          
            23 commits
          
        
        Select commit
          Hold shift + click to select a range
      
      76c9bc1
              
                implement unary REGLU/GEGLU/SWIGLU cpu ops
              
              
                CISC 5a490f0
              
                relax constraints
              
              
                CISC 56c7993
              
                duplicate shape of source
              
              
                CISC 21c4963
              
                fix ggml_vec_geglu_f16
              
              
                CISC bb2fda7
              
                special case gated ops
              
              
                CISC a1a7b6d
              
                implement unary REGLU/GEGLU/SWIGLU cuda ops
              
              
                CISC f8c2080
              
                tighten constraints again
              
              
                CISC a341aa3
              
                refactor into GGML_GLU_OP
              
              
                CISC d9ddeb9
              
                metal : add glu kernels
              
              
                ggerganov cfa9c7a
              
                add CUDA_GLU_BLOCK_SIZE [no ci]
              
              
                CISC 70e8b48
              
                more constraints and use 64bit ints
              
              
                CISC f8705a2
              
                64bit multiplication [no ci]
              
              
                CISC 0b2703f
              
                implement swapped variants (cpu/cuda)
              
              
                CISC d593429
              
                update comment [no ci]
              
              
                CISC 34d1aed
              
                Vulkan: Add GLU ops and shaders
              
              
                0cc4m a9aedf4
              
                SYCL: Implement fused kernel GEGLU, SWIGLU and REGLU for single up+gate
              
              
                qnixsynapse 35dacd1
              
                ggml : implement GLU for split up/gate (#14181)
              
              
                CISC a234e09
              
                GGML: increase OP count in assertion
              
              
                qnixsynapse ab46d11
              
                Refactor: Optimize SYCL element-wise operations with unary function i…
              
              
                qnixsynapse 650d398
              
                vulkan: Increase workgroup size for GLU, for performance (#14345)
              
              
                jeffbolznv 8bca3e2
              
                merge fix
              
              
                CISC a6b641e
              
                Merge branch 'master' into cisc/unary-reglu-geglu-swiglu
              
              
                CISC 9054f1a
              
                metal : add support for split and swap
              
              
                ggerganov File filter
Filter by extension
Conversations
          Failed to load comments.   
        
        
          
      Loading
        
  Jump to
        
          Jump to file
        
      
      
          Failed to load files.   
        
        
          
      Loading
        
  Diff view
Diff view
There are no files selected for viewing
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
      
      Oops, something went wrong.
        
    
  
      
      Oops, something went wrong.
        
    
  
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
Uh oh!
There was an error while loading. Please reload this page.