- 
                Notifications
    You must be signed in to change notification settings 
- Fork 352
Pull requests: pytorch/ao
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
      Update config.py to negate the dimension issue for FP8 support for AMD GPUs
      
    
      
  
        
          #3246
            opened Oct 24, 2025  by
            kailashg26
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Only convert to int4 preshuffled tensor in H100
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: improvement
  Use this tag if this PR is an improvement (doesn't fit into any of the other categories) 
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3245
            opened Oct 24, 2025  by
            jerryzh168
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      add accelerate to dev-requirements to fix CI
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3244
            opened Oct 24, 2025  by
            jcaip
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      fix the outdated end2end training examples of moe+torchtitan
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3242
            opened Oct 24, 2025  by
            rakkit
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      introduce new int8 quantization API
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3241
            opened Oct 24, 2025  by
            namgyu-youn
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      nvfp4: support inference_mode and rank 3
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: improvement
  Use this tag if this PR is an improvement (doesn't fit into any of the other categories) 
        
      
    
      
  
        
          #3240
            opened Oct 24, 2025  by
            vkuzo
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      mx: support inference_mode and rank 3+
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: improvement
  Use this tag if this PR is an improvement (doesn't fit into any of the other categories) 
        
      
    
      
  
        
          #3238
            opened Oct 24, 2025  by
            vkuzo
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Support int8 output for scaled_embedding_bag
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: new feature
  Use this tag if this PR adds a new feature 
        
      
    
      
  
        
          #3231
            opened Oct 23, 2025  by
            shiyang-weng
            
        
        
            
    •
    
      Draft
    
  
        
        
      
    
      [CPU] Improve INT8 SDPA template
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3230
            opened Oct 23, 2025  by
            Xia-Weiwen
            
        
        
            
    •
    
      Draft
    
  
        
        
      
    
      Add per tensor fp8 quantization support for conv3d
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3215
            opened Oct 20, 2025  by
            jerryzh168
            
        
        
            
    •
    
      Draft
    
  
        
        
      
    
      [mxfp8 moe training] bench and profile mxfp8 a2a fwd and bwd separately
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                moe
              
                mx
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3203
            opened Oct 17, 2025  by
            danielvegamyhre
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [wip] speed up nvfp4 triton kernel
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3202
            opened Oct 17, 2025  by
            vkuzo
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [mxfp8 moe training] improve unit tests
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3201
            opened Oct 17, 2025  by
            danielvegamyhre
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [mxfp8 moe training] integrate triton quant/dequant kernels into mxfp8 all to all
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                moe
              
                mx
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3197
            opened Oct 17, 2025  by
            danielvegamyhre
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [mxfp8 moe training] add triton kernel for mxfp8 dequantization
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                moe
              
                mx
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3195
            opened Oct 16, 2025  by
            danielvegamyhre
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Intel GPU] enable use of dinov2 models for offload benchmark_low_bit_adam
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: improvement
  Use this tag if this PR is an improvement (doesn't fit into any of the other categories) 
        
      
    
      
  
        
          #3191
            opened Oct 16, 2025  by
            arlesniak
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Fix setup.py to skip CPU kernels on Windows
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3187
            opened Oct 16, 2025  by
            Xia-Weiwen
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [not4land] Some fixes for MXFP8
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3183
            opened Oct 15, 2025  by
            jerryzh168
            
        
        
            
    •
    
      Draft
    
  
        
        
      
    
      [WIP] Move float8 cutlass sparse layout to Float8SemiSparseTensor
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3182
            opened Oct 15, 2025  by
            bbeckca
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Intel GPU] Extend TestQAT module with xpu testcases
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3177
            opened Oct 15, 2025  by
            agrabow
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Optional: use inv_scale for WeightTensorWithLinearActivationScaleMetadata
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: not user facing
  Use this tag if you don't want this PR to show up in release notes 
        
      
    
      
  
        
          #3176
            opened Oct 15, 2025  by
            Xia-Weiwen
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Add NPU (Ascend) backend support for INT4 weight-only quantization workflow
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3172
            opened Oct 14, 2025  by
            orangeH25
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [draft] Update Float8Tensor for GRPO training in unsloth
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
              
                topic: improvement
  Use this tag if this PR is an improvement (doesn't fit into any of the other categories) 
        
      
    
      
  
        
          #3158
            opened Oct 12, 2025  by
            andrewor14
            
        
        
            
    •
    
      Draft
    
  
        
        
      
    
      Introduce SINQ quantization algorithm
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3156
            opened Oct 11, 2025  by
            namgyu-youn
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Testing subclassing
        
              
                CLA Signed
  This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. 
        
      
    
      
  
        
          #3153
            opened Oct 10, 2025  by
            Krishn1412
            
        
        
            
    
  
    Loading…
 
        
        
      
    Previous Next
  
  
  ProTip!
  Mix and match filters to narrow down what you’re looking for.