nanoGPT FP8 compilation failed #353

yizhuoz004 · 2024-11-08T20:41:39Z

Error:

(t5799)error: result at index 0 has type 'tensor<1x?x768xf8E4M3FN>', but decomposition has type 'tensor<?x?x?xf8E4M3FN>'

The text was updated successfully, but these errors were encountered:

christopherbate · 2024-11-16T08:08:35Z

This one is a simple fix, I'll ensure it gets sync'd up here tomorrow

pranavm-nvidia · 2024-12-13T19:21:51Z

I have a draft PR to enable float8, but still seeing the same error:

    (t4723)error: result at index 0 has type 'tensor<1x?x768xf8E4M3FN>', but decomposition has type 'tensor<?x?x?xf8E4M3FN>'

    This error occured while trying to compile the following FlatIR expression:
          |
          | t4723: [rank=(3), shape=((-1, -1, -1)), dtype=(float8), loc=(gpu:0)] = ConvertOp(t_inter1106)
          | 


    Note: This originated from the following expression:

    --> /tripy/tripy/frontend/module/linear.py:136 in __call__()
          |
      136 |                 q_x = quantize(x, self.input_scale, self.quant_dtype)
          |                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

yizhuoz004 added the mlir-tensorrt Pull request for the mlir-tensorrt project label Nov 8, 2024

christopherbate self-assigned this Nov 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nanoGPT FP8 compilation failed #353

nanoGPT FP8 compilation failed #353

yizhuoz004 commented Nov 8, 2024

christopherbate commented Nov 16, 2024

pranavm-nvidia commented Dec 13, 2024

nanoGPT FP8 compilation failed #353

nanoGPT FP8 compilation failed #353

Comments

yizhuoz004 commented Nov 8, 2024

christopherbate commented Nov 16, 2024

pranavm-nvidia commented Dec 13, 2024