NVIDIA / Fuser Public

Notifications You must be signed in to change notification settings
Fork 56
Star 317

Code
Issues 222
Pull requests 156
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Wiki
Security
Insights

Pull requests: NVIDIA/Fuser

Labels 46 Milestones 0

New pull request New

156 Open 3,271 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix scheduling of split-K with smem_epilogue on Hopper Matmuls

#4257 opened Apr 16, 2025 by jacobhinkle

Loading…

Pm/preseg reorder sharded

#4256 opened Apr 16, 2025 by Priya2698 • Draft

Remove several uses of NVFUSER_DISTRIBUTED

#4255 opened Apr 15, 2025 by wujingyue

Loading…

shardAllLike accepts a list of parallel types

#4254 opened Apr 15, 2025 by Priya2698

Loading…

[Stream Lowering] second milestone: collective-based comm+compute pipelines

#4252 opened Apr 15, 2025 by samnordmann • Draft

Create short-circuit in persistent outer for-loop to minimize cost of wave quantization. Matmuls

#4249 opened Apr 14, 2025 by rdspring1

Loading…

Enable TensorIndexer with the transpose tests

#4246 opened Apr 12, 2025 by naoyam • Draft

Add support for 2d grid swizzle in hopper matmul scheduler. Matmuls

#4243 opened Apr 11, 2025 by rdspring1

Loading…

warp specializied tma persistent kernel, step-2, use TMA load

#4240 opened Apr 11, 2025 by liqiangxl

Loading…

InsertReshardingsPass decomposes matmul/linear+ReduceScatter.

#4239 opened Apr 11, 2025 by wujingyue

Loading…

Use TensorIndexer for the view tests

#4237 opened Apr 11, 2025 by naoyam

Loading…

Add missing non-divisible predicates in TensorIndexer idmodel

#4236 opened Apr 11, 2025 by naoyam • Draft

[WIP] Move edge ownership to SegmentedGroup using shared_ptr

#4235 opened Apr 11, 2025 by csarofeen • Draft

check ID coverage for reference_tv in reduction scheduler

#4223 opened Apr 10, 2025 by jjsjann123 • Draft

2 tasks done

Add segmentation helper functions for edge processing

#4222 opened Apr 9, 2025 by csarofeen • Draft

WIP: ScanOp

#4211 opened Apr 8, 2025 by jacobhinkle • Draft

wip

#4200 opened Apr 7, 2025 by liqiangxl • Draft

[DRAFT] Refactor python build

#4193 opened Apr 4, 2025 by rdspring1 • Draft

DID loop split for scatter

#4191 opened Apr 4, 2025 by Priya2698

Loading…

fix register spills in thread local outer reduction

#4184 opened Apr 3, 2025 by liqiangxl

Loading…

Stream lowering (latest branch)

#4179 opened Apr 3, 2025 by samnordmann • Draft

[WIP] Support scalar outputs from fusions #3947

#4162 opened Apr 1, 2025 by csarofeen • Draft

Create Statement, Expr, and Val bindings Direct Bindings

Python extension with direct mapping to NvFuser CPP objects.

Python API

Issues related to the Python API

#4157 opened Mar 30, 2025 by rdspring1

Loading…

Create direct_bindings_api extension Direct Bindings

Python extension with direct mapping to NvFuser CPP objects.

Python API

Issues related to the Python API

#4156 opened Mar 30, 2025 by rdspring1 • Draft

[Host Ir] stream lowering, first milestone: single device fusion

#4148 opened Mar 26, 2025 by samnordmann • Draft

Previous 1 2 3 4 5 6 7 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly