Add NANOO FP8 Support #2695

ScXfjiang · 2024-09-30T20:28:55Z

This PR adds tf.experimental.float8_e4m3fnuz and tf.experimental.float8_e5m2fnuz as public tensorflow data types.

similar work:

wenchenvincent · 2024-10-03T19:05:42Z

tensorflow/core/framework/tensor.cc

@@ -563,6 +563,14 @@ struct ProtoHelper<float8_e5m2> : public Float8ProtoHelper<float8_e5m2> {};
 template <>
 struct ProtoHelper<float8_e4m3fn> : public Float8ProtoHelper<float8_e4m3fn> {};

+template <>


Nit: Format the code the same way as that for ocp fp8?

I believe it's auto-formatted by clang-format.
https://github.com/ROCm/tensorflow-upstream/blob/develop-upstream/tensorflow/.clang-format

wenchenvincent · 2024-10-03T19:08:59Z

tensorflow/python/framework/tensor_util_test.py

@@ -271,8 +271,8 @@ def testBfloat16(self):
  def testFloat8e5m2(self):
    test_type = dtypes.float8_e5m2.as_numpy_dtype
    t = tensor_util.make_tensor_proto(np.array([10.0, 20.0], dtype=test_type))
-    # 10.0: "I" = 73 = 10010 01: 2^(18 - 15) * (1 + 1/4)
-    # 20.0: "M" = 77 = 10011 01: 2^(19 - 15) * (1 + 1/4)
+    # 10.0: "I" = 73 = 0 10010 01: 2^(18 - 15) * (1 + 1/4)


Nit: Why do we need the extra 0 here? For more clarity?

Updated. (I just wanted to display all bit positions of FP8. But I agree that our repo should be consistent with the upstream repo as possible as we can, for less pain in the weekly sync.)

wenchenvincent

LGTM.

Add NANOO FP8 Support To OPs

ScXfjiang force-pushed the dev_nanoo_fp8 branch from 7c84244 to 77f6214 Compare September 30, 2024 21:10

ScXfjiang added 11 commits October 1, 2024 20:40

tensorflow/core/

1129ff1

revert version

7308cb2

tensorflow/go/

5d7c925

tensorflow/c/

44c2e03

other files

c46fb94

fix typo

929b637

tensorflow/python/

1086598

update tensorflow/python/

069a82c

update in third_party xla

18fc45a

tensorflow/compiler/

b4b5462

update comment and revert format

8ecf02f

ScXfjiang force-pushed the dev_nanoo_fp8 branch from 6aa7078 to 8ecf02f Compare October 1, 2024 20:42

format

7a6169d

ScXfjiang marked this pull request as ready for review October 1, 2024 21:16

ScXfjiang requested a review from wenchenvincent October 1, 2024 21:17

ScXfjiang added 5 commits October 2, 2024 12:58

fix dtypes_test.py

30a1cdb

switch the order of "e4m3fnuz" and "e5m2fnuz"

324dd30

fix tensor_util_test

c1b4b5c

fix typo

c73c18f

update comments

614a507

wenchenvincent reviewed Oct 3, 2024

View reviewed changes

wenchenvincent approved these changes Oct 3, 2024

View reviewed changes

ScXfjiang added 4 commits October 4, 2024 10:36

remove sign bit

ecc342f

remove unnecessary op support in this PR

5f3a6d7

cast op

fb6246a

Merge pull request #2702 from ROCm/dev_nanoo_fp8_ops

b5fb570

Add NANOO FP8 Support To OPs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add NANOO FP8 Support #2695

Add NANOO FP8 Support #2695

ScXfjiang commented Sep 30, 2024 •

edited

Loading

wenchenvincent Oct 3, 2024

ScXfjiang Oct 4, 2024

wenchenvincent Oct 3, 2024 •

edited

Loading

ScXfjiang Oct 4, 2024

wenchenvincent left a comment

Add NANOO FP8 Support #2695

Are you sure you want to change the base?

Add NANOO FP8 Support #2695

Conversation

ScXfjiang commented Sep 30, 2024 • edited Loading

wenchenvincent Oct 3, 2024

Choose a reason for hiding this comment

ScXfjiang Oct 4, 2024

Choose a reason for hiding this comment

wenchenvincent Oct 3, 2024 • edited Loading

Choose a reason for hiding this comment

ScXfjiang Oct 4, 2024

Choose a reason for hiding this comment

wenchenvincent left a comment

Choose a reason for hiding this comment

ScXfjiang commented Sep 30, 2024 •

edited

Loading

wenchenvincent Oct 3, 2024 •

edited

Loading