Create Highlevel Bindings for FP8 Datatype #788

isVoid · 2026-02-11T22:11:55Z

This PR adds high level exposure of fp8 data type to Numba-CUDA.

Supported features include:

FP8 constructors from existing data type (elementwise and packed)
Conversion intrinsics that provide finer control of saturation type

Supported FP8 variants (element wise):

fp8_[e5m2, e4m3, e8m0]

Supported packed FP8 variants:

fp8[x2, x4]_[e5m2, e4m3, e8m0]

This PR also adds tests for packed type bindings introduced in #686.

closes #200

greptile-apps · 2026-02-11T22:11:59Z

Automatic reviews are disabled for this repository.

copy-pr-bot · 2026-02-11T22:11:59Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

isVoid · 2026-02-11T22:39:10Z

numba_cuda/numba/cuda/fp8.py

+from numba.cuda.extending import register_jitable
+
+
+@register_jitable


The use of @register_jittable is recommended by code agents. Is this a good choice in today's Numba?

It only makes sense if you want to call the function as a pure Python function.

(So register_jitable just makes no sense in the context of Numba-CUDA)

This PR shows that register_jittable may enable jitting function that's not directly callable in pure python. As shown by

@register_jitable def bfloat16_to_e8m0(x, saturate, rounding): return _cvt_bfloat16raw_to_e8m0( _bfloat16_as_bfloat16_raw(x), saturate, rounding )

Where _bfloat16_as_bfloat16_raw is written as numba intrinsics and may not be called with these arguments as-is.

Proposing updating the docstring of register_jittable and keeping the function.

isVoid · 2026-02-12T01:54:57Z

/ok to test 894b79d

isVoid added 5 commits February 10, 2026 20:42

initial

1689211

add additional packed type

3f7a3fe

add packed type test and doc

f3210ff

update high level enum name and docs

76ffbde

exposes bfloat16 to fp8 conversion directly

894b79d

isVoid commented Feb 11, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Highlevel Bindings for FP8 Datatype #788

Create Highlevel Bindings for FP8 Datatype #788

isVoid commented Feb 11, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Feb 11, 2026

Uh oh!

copy-pr-bot bot commented Feb 11, 2026

Uh oh!

isVoid Feb 11, 2026

Uh oh!

gmarkall Feb 12, 2026

Uh oh!

gmarkall Feb 12, 2026

Uh oh!

isVoid Feb 12, 2026 •

edited

Loading

Uh oh!

isVoid commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		from numba.cuda.extending import register_jitable


		@register_jitable

Create Highlevel Bindings for FP8 Datatype #788

Are you sure you want to change the base?

Create Highlevel Bindings for FP8 Datatype #788

Conversation

isVoid commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

greptile-apps bot commented Feb 11, 2026

Uh oh!

copy-pr-bot bot commented Feb 11, 2026

Uh oh!

isVoid Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

gmarkall Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

gmarkall Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

isVoid Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

isVoid commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

isVoid commented Feb 11, 2026 •

edited

Loading

isVoid Feb 12, 2026 •

edited

Loading