-
Notifications
You must be signed in to change notification settings - Fork 3k
Pull requests: microsoft/onnxruntime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[js/node] allow arenaExtendStrategy and gpuMemLimit option for CUDA EP
#23176
opened Dec 21, 2024 by
nomagick
Loading…
[js/web] Add package export tests for onnxruntime-web
#23175
opened Dec 20, 2024 by
fs-eire
Loading…
[QNN EP] Fix multithread sync bug in ETW callback
#23156
opened Dec 19, 2024 by
adrianlizarraga
Loading…
[WebNN] Support SkipSimplifiedLayerNormalization op
ep:WebNN
WebNN execution provider
#23151
opened Dec 19, 2024 by
Honry
Loading…
[webgpu] support Pad operator
ep:WebGPU
ort-web webgpu provider
#23141
opened Dec 18, 2024 by
xhcao
Loading…
Make MultiHeadAttention op return attention probabilities
#23125
opened Dec 16, 2024 by
amancini-N
Loading…
[QNN EP] [DRAFT] Make QNN EP a shared library
ep:QNN
issues related to QNN exeution provider
#23120
opened Dec 16, 2024 by
adrianlizarraga
•
Draft
Implements Slice Operator for WebGPU Native
ep:WebGPU
ort-web webgpu provider
#23106
opened Dec 13, 2024 by
prathikr
Loading…
[js/webgpu] Optimize matmulnbits with M > 1
ep:WebGPU
ort-web webgpu provider
#23092
opened Dec 12, 2024 by
qjia7
Loading…
Implement some missing element wise Add/Sub/Mul/Div/Neg operations for CPU and CUDA EPs
#23090
opened Dec 12, 2024 by
Zyrin
Loading…
[WebNN EP] Automatically move input CPU tensors to ml-tensor
ep:WebNN
WebNN execution provider
#23073
opened Dec 11, 2024 by
egalli
Loading…
Improves 2d tiled matmulnbits by repeating A, loads N times for each B load
ep:WebGPU
ort-web webgpu provider
#23071
opened Dec 10, 2024 by
sushraja-msft
Loading…
Upgrade Java version from react-native/android to Java 17
#23066
opened Dec 10, 2024 by
jchen351
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.