-
Notifications
You must be signed in to change notification settings - Fork 610
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Open source two simplical attention kernels
cla signed
fb-exported
#4445
opened Jul 3, 2025 by
choutim
Loading…
Migrate jagged tensor kernels to
FBGEMM_LAUNCH_KERNEL
, pt 4
cla signed
fb-exported
#4441
opened Jul 3, 2025 by
q10
Loading…
kv embedding dram delta loading in predictor
cla signed
fb-exported
#4438
opened Jul 2, 2025 by
EddyLXJ
Loading…
Support get/set the whole row of metaheader+weight+optimizer from backend for checkpoint saving/loading
cla signed
fb-exported
#4435
opened Jul 2, 2025 by
bobbyliujb
Loading…
Optimize tbe_input_combine_with_length_cuda on AMD
cla signed
fb-exported
#4430
opened Jul 1, 2025 by
JChunX
Loading…
Support get/set the whole row of metaheader+weight+optimizer from backend for checkpoint saving/loading
cla signed
fb-exported
#4429
opened Jul 1, 2025 by
bobbyliujb
Loading…
Use static functions/variables if possible (#4423)
cla signed
fb-exported
#4427
opened Jul 1, 2025 by
q10
Loading…
Call torchrec cpu tests from fbgemm test gha
cla signed
fb-exported
#4424
opened Jul 1, 2025 by
nipung90
Loading…
Invoke AMD specific kernel reorder_batched_ad_indices_kernel_vec
cla signed
fb-exported
#4412
opened Jun 27, 2025 by
ghq24int
Loading…
Migrate jagged tensor kernels to
FBGEMM_LAUNCH_KERNEL
, pt 3
cla signed
fb-exported
#4411
opened Jun 27, 2025 by
q10
Loading…
Add Manifold wrapper (attemp 2) Part1 Backend
cla signed
fb-exported
#4410
opened Jun 27, 2025 by
gchalump
Loading…
Support optimizer state offloading for partial rowwise adam optimizer
cla signed
fb-exported
#4405
opened Jun 26, 2025 by
q10
Loading…
Add Manifold wrapper (attemp 2) Part2 Frontend
cla signed
fb-exported
#4404
opened Jun 26, 2025 by
gchalump
Loading…
Fix CQS signal facebook-unused-include-check in fbcode/deeplearning/fbgemm/fbgemm_gpu/src/input_combine_ops
cla signed
fb-exported
#4402
opened Jun 26, 2025 by
q10
Loading…
Fix CQS signal facebook-unused-include-check in fbcode/deeplearning/fbgemm/src [B] [B] [A]
cla signed
fb-exported
#4400
opened Jun 25, 2025 by
q10
Loading…
reorder_batched_ad_indices_kernel on IFR/CFR shape results
cla signed
fb-exported
#4393
opened Jun 23, 2025 by
ghq24int
Loading…
Revert D76866940
ci-no-td
cla signed
fb-exported
#4391
opened Jun 23, 2025 by
PatriceVignola
Loading…
Back out "Add manifold wrapper"
cla signed
fb-exported
#4387
opened Jun 20, 2025 by
gchalump
Loading…
add meta impl for int4 preshuffle kernels
cla signed
fb-exported
#4384
opened Jun 20, 2025 by
garroud
Loading…
add monitroing metrics for dram cache perf
cla signed
fb-exported
#4383
opened Jun 20, 2025 by
kathyxuyy
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.