fix: torchtrtc precision setting logic #3883

yeetypete · 2025-11-03T19:46:24Z

Description

Ensures that torchtrtc precision settings do not always contain a default fp32 precision when the precision is explicitly passed as an argument. This is particularly important when compiling a model to run on the DLA which does not allow fp32 precision. Currently this must not have been possible to do with the torchtrtc cli.

Bug example:

torchtrtc ssd_traced.jit.pt ssd_trt_dla.ts "(1,3,300,300)@f16%contiguous" -p fp16 --device-type=dla -v
...
INFO: Settings requested for TensorRT engine:
    Enabled Precisions: Float32 Float16
    TF32 Floating Point Computation Enabled: 1
    Truncate Long and Double: 0
    Make Refittable Engine: 0
    Debuggable Engine: 0
    GPU ID: 0
    Allow GPU Fallback (if running on DLA): 0
    Avg Timing Iterations: 1
    Max Workspace Size: 0
    DLA SRAM Size: 1048576
    DLA Local DRAM Size: 1073741824
    DLA Global DRAM Size: 536870912
    Device Type: DLA
    GPU ID: 0
    DLACore: 0
    Engine Capability: standard
    Calibrator Created: 0

This should only report Float16 enabled precision.

Type of change

Please delete options that are not relevant and/or add your own.

Bug fix (non-breaking change which fixes an issue)

Checklist:

My code follows the style guidelines of this project (You can use the linters)
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas and hacks
I have made corresponding changes to the documentation
I have added tests to verify my fix or my feature
New and existing unit tests pass locally with my changes
I have added the relevant labels to my PR in so that relevant reviewers are notified

… a precision is specified in torchtrtc

meta-cla · 2025-11-03T19:46:30Z

Hi @yeetypete!

Thank you for your pull request and welcome to our community.

Action Required

In order to merge any pull request (code, docs, etc.), we require contributors to sign our Contributor License Agreement, and we don't seem to have one on file for you.

Process

In order for us to review and merge your suggested changes, please sign at https://code.facebook.com/cla. If you are contributing on behalf of someone else (eg your employer), the individual CLA may not be sufficient and your employer may need to sign the corporate CLA.

Once the CLA is signed, our tooling will perform checks and validations. Afterwards, the pull request will be tagged with CLA signed. The tagging process may take up to 1 hour after signing. Please give it that time before contacting us about it.

If you have received this in error or have any questions, please contact us at [email protected]. Thanks!

meta-cla · 2025-11-03T19:57:44Z

Thank you for signing our Contributor License Agreement. We can now accept your code for this (and any) Meta Open Source project. Thanks!

narendasan · 2025-11-05T17:20:44Z

Thanks for the PR, it seems fine, we will test locally since there are some issues in CI.
One question that is somewhat of an aside.

Is there some limitation in the Dynamo / Exported Program workflow that makes it hard to use for DLA? If so this is something we would like to fix so that users can port off TorchScript since the overall PyTorch ecosystem is moving in that direction.

yeetypete · 2025-11-05T17:56:44Z

@narendasan thanks for the quick response.

Is there some limitation in the Dynamo / Exported Program workflow that makes it hard to use for DLA?

Currently we haven't tried using Dynamo / Exported Program workflow since we are interested in a Pythonless deployment with ahead-of-time compilation. However, I did notice that the AOT-Inductor pythonless deployment is in beta. It seems like currently you need to use Python to perform the export. This isn't a huge obstacle but it would be great to know if it is possible to perform this export step without Python just using libtorch and tensorrt c++ APIs. Maybe this could be something torchtrtc could support in the future?

narendasan · 2025-11-13T22:46:20Z

AFAIK there isnt a way to torch_tensorrt + aot_compile in C++ currently but we can look into it / raise the usecase in the PyTorch ecosystem. But yes you should still be able to deploy in C++ through AOTInductor + Torch-TensorRT given export / compile in python today

yeetypete · 2025-11-17T08:08:44Z

In the end we were able to do model conversion with Python via a docker container so C++ only aot_compile isn't a big blocker for us anymore. We will do some more testing to see if anything else comes up but so far looks good.

fix: ensure compiler_settings do not contain any default precision if…

750dae9

… a precision is specified in torchtrtc

github-actions bot added the component: api [C++] Issues re: C++ API label Nov 3, 2025

meta-cla bot added the cla signed label Nov 3, 2025

github-actions bot requested a review from narendasan November 3, 2025 19:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: torchtrtc precision setting logic #3883

fix: torchtrtc precision setting logic #3883

yeetypete commented Nov 3, 2025 •

edited

Loading

Uh oh!

meta-cla bot commented Nov 3, 2025

Uh oh!

meta-cla bot commented Nov 3, 2025

Uh oh!

narendasan commented Nov 5, 2025

Uh oh!

yeetypete commented Nov 5, 2025 •

edited

Loading

Uh oh!

narendasan commented Nov 13, 2025

Uh oh!

yeetypete commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: torchtrtc precision setting logic #3883

Are you sure you want to change the base?

fix: torchtrtc precision setting logic #3883

Conversation

yeetypete commented Nov 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Checklist:

Uh oh!

meta-cla bot commented Nov 3, 2025

Action Required

Process

Uh oh!

meta-cla bot commented Nov 3, 2025

Uh oh!

narendasan commented Nov 5, 2025

Uh oh!

yeetypete commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

narendasan commented Nov 13, 2025

Uh oh!

yeetypete commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

yeetypete commented Nov 3, 2025 •

edited

Loading

yeetypete commented Nov 5, 2025 •

edited

Loading