Popular repositories Loading
-
-
nnvm-vision-demo
nnvm-vision-demo PublicDemos interesting image-in, image-out networks running on both NVIDIA and AMD GPUs, with NNVM
-
tvm-winograd
tvm-winograd PublicTest winograd convolution written in TVM for CUDA and AMDGPU
-
-
-
mxnet-cpp-inference
mxnet-cpp-inference PublicTest MXNet C++ API for doing inference, given a trained model
95 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | April Apr | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Contribution activity
April 2025
Created 1 commit in 1 repository
Created a pull request in triton-lang/triton that received 42 comments
[TritonGPU] Enable accum-init optimization for unconditionally zero-ed accumulators
Currently, the pass doesn't fire when there is no explicit op that conditionally clears the accumulator. Thus, it misses the simplest case where th…
+136
−78
lines changed
•
42
comments
Reviewed 7 pull requests in 1 repository
triton-lang/triton
7 pull requests
-
[Blackwell] Support DescriptorLoadOp when deciding to use shared memory for scales
This contribution was made on Apr 24
-
[Blackwell] Decouple
tcgen05.commit
fromtcgen05.mma
andtcgen05.cp
opsThis contribution was made on Apr 12 -
[TUTORIAL] Replace legacy host side TMA with TensorDescriptor
This contribution was made on Apr 11
-
[Feature] Support different packing formats in dot_scaled op
This contribution was made on Apr 8
-
[TritonGPU] Enable accum-init optimization for unconditionally zero-ed accumulators
This contribution was made on Apr 7
-
[NVWS] Add initial version of Aref Lowering
This contribution was made on Apr 5
-
[TUTORIAL] Cleanup persistent matmul tutorial
This contribution was made on Apr 4