Modular Co-Design Interpolants #554

nvdreidenbach · 2024-12-24T16:38:49Z

Release of v1.0 of BioNeMo Modular Co-Design (MoCo)

Introduces modular interpolants for various popular generative model frameworks including continuous and discrete diffusion and flow matching.

Summary

Introduces MoCo.

Details

See documentation.md for details.

Usage

pip install bionemo-moco

from bionemo.moco.interpolants import ContinuousFlowMatcher
from bionemo.moco.distributions.time import UniformTimeDistribution
from bionemo.moco.distributions.prior import GaussianPrior

uniform_time = UniformTimeDistribution()
moon_prior = GaussianPrior()
sigma = 0.1
cfm = ContinuousFlowMatcher(time_distribution=uniform_time, 
                            prior_distribution=moon_prior, 
                            sigma=sigma, 
                            prediction_type="velocity")

see examples directory for notebook tutorials

Testing

Unit tests for all key functions.

Tests for these changes can be run via:

pytest -v tests

jstjohn · 2024-12-24T22:10:10Z

/build-ci

sub-packages/bionemo-moco/environment/moco.yaml

jstjohn · 2025-01-02T18:37:37Z

/build-ci

sub-packages/bionemo-moco/scripts/clean_documentation.py

nvdreidenbach · 2025-01-02T23:26:02Z

/build-ci

Signed-off-by: Danny <[email protected]>

When NvFaidx was used on Fasta files containing duplicate sequence ids, which violates the FASTA spec, it would silently fail and use the last-seen sequence as an entry. This PR fails by default and exposes a parameter to ignore sequence_ids and use integer indexing instead. Signed-off-by: Danny <[email protected]>

Update DDP config to speed up ESM-2 15B pretraining Turn off `grad_reduce_in_fp32` in mixed precision plugin (default is True) to reduce memory consumption and `overlap_grad_reduce, and `average_in_collective` to improve performance. Pause `overlap_param_gather=True` to wait for NeMo's fix. Signed-off-by: Danny <[email protected]>

Pins mistune to fix a jupyter notebook build issue introduced in 3.1.0 lepture/mistune#403 Bypassing review rules to fix CI due to holiday OOO Signed-off-by: Danny <[email protected]>

Signed-off-by: Danny <[email protected]>

Bumps [3rdparty/Megatron-LM](https://github.com/NVIDIA/Megatron-LM) from `99f23d2` to `2da43ef`. <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/2da43ef4c1b9e76f03b7567360cf7390e877f1b6"><code>2da43ef</code></a> Merge branch 'mmodal_eval_in_folder' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/e51a3ac1dcd366f51bcb0339ecca31790c3cfcd1"><code>e51a3ac</code></a> ADLR/megatron-lm!2491 - Move mmodal evaluation code to its own folder</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/d3c585e90ebd5937243c8d4c9d5d5cf9d61665d6"><code>d3c585e</code></a> Merge branch 'jbarker/pp_unfreeze' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/1468ab01c079d5e14888dda97d1c99d2cb62afb2"><code>1468ab0</code></a> ADLR/megatron-lm!2285 - Support --freeze-LM and --freeze-ViT with ranks that ...</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/cf25d44037af4e9d5ea723918823de9b2416a30c"><code>cf25d44</code></a> Merge branch 'boxin/nvlm_ckpt_release' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/1da9dad62b97917caacb1fd271abaed403581caa"><code>1da9dad</code></a> ADLR/megatron-lm!2494 - Add model checkpoint links</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/25b1f33035ad55eeae6b9a4367f987f1fac804dd"><code>25b1f33</code></a> Merge branch 'helenn-rope-fusion-mem-layout' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/7bb53792831d80007789ff5c60bc1798cbd34548"><code>7bb5379</code></a> ADLR/megatron-lm!2469 - Correct strides for bshd layout and revert RoPE tests...</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/b8420a1909980aa3b6750f75b2d7ab8b23338948"><code>b8420a1</code></a> Merge branch 'group_topk' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/d0df563d8739e4dfe2b0e90ba190ac389f165157"><code>d0df563</code></a> ADLR/megatron-lm!1934 - Support Device-Limited Routing and Sequence Auxiliary...</li> <li>Additional commits viewable in <a href="https://github.com/NVIDIA/Megatron-LM/compare/99f23d2f111d12b73b1fbf386c60517101ff8abe...2da43ef4c1b9e76f03b7567360cf7390e877f1b6">compare view</a></li> </ul> </details> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Signed-off-by: Danny <[email protected]>

Signed-off-by: Danny <[email protected]>

Bumps [3rdparty/Megatron-LM](https://github.com/NVIDIA/Megatron-LM) from `99f23d2` to `2da43ef`. <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/2da43ef4c1b9e76f03b7567360cf7390e877f1b6"><code>2da43ef</code></a> Merge branch 'mmodal_eval_in_folder' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/e51a3ac1dcd366f51bcb0339ecca31790c3cfcd1"><code>e51a3ac</code></a> ADLR/megatron-lm!2491 - Move mmodal evaluation code to its own folder</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/d3c585e90ebd5937243c8d4c9d5d5cf9d61665d6"><code>d3c585e</code></a> Merge branch 'jbarker/pp_unfreeze' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/1468ab01c079d5e14888dda97d1c99d2cb62afb2"><code>1468ab0</code></a> ADLR/megatron-lm!2285 - Support --freeze-LM and --freeze-ViT with ranks that ...</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/cf25d44037af4e9d5ea723918823de9b2416a30c"><code>cf25d44</code></a> Merge branch 'boxin/nvlm_ckpt_release' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/1da9dad62b97917caacb1fd271abaed403581caa"><code>1da9dad</code></a> ADLR/megatron-lm!2494 - Add model checkpoint links</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/25b1f33035ad55eeae6b9a4367f987f1fac804dd"><code>25b1f33</code></a> Merge branch 'helenn-rope-fusion-mem-layout' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/7bb53792831d80007789ff5c60bc1798cbd34548"><code>7bb5379</code></a> ADLR/megatron-lm!2469 - Correct strides for bshd layout and revert RoPE tests...</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/b8420a1909980aa3b6750f75b2d7ab8b23338948"><code>b8420a1</code></a> Merge branch 'group_topk' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/d0df563d8739e4dfe2b0e90ba190ac389f165157"><code>d0df563</code></a> ADLR/megatron-lm!1934 - Support Device-Limited Routing and Sequence Auxiliary...</li> <li>Additional commits viewable in <a href="https://github.com/NVIDIA/Megatron-LM/compare/99f23d2f111d12b73b1fbf386c60517101ff8abe...2da43ef4c1b9e76f03b7567360cf7390e877f1b6">compare view</a></li> </ul> </details> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Signed-off-by: Danny <[email protected]>

Signed-off-by: Danny <[email protected]>

Attempts to update the base image to the most recent 24.12 release Signed-off-by: Danny <[email protected]>

## Summary Un-xfails a geneformer H100 test. ## Details After base image upgrade to pytorch fw 24.12 (NVIDIA#553) , H100 geneformer issue is fixed. ## Usage and Testing ```python pytest ./sub-packages/bionemo-geneformer/tests/bionemo/geneformer/test_model.py::test_geneformer_nemo1_v_nemo2_inference_golden_values ``` Signed-off-by: Danny <[email protected]>

The new ubuntu base container contains a couple of changes that breaks the (untested in CI) base container: 1. it now has a default 1000:1000 `ubuntu` user we can use, instead of creating a new bionemo user. 2. it uses python 3.12, which changes some of our copy paths. --------- Signed-off-by: Peter St. John <[email protected]> Signed-off-by: Danny <[email protected]>

Signed-off-by: Danny <[email protected]>

Bumps [3rdparty/Megatron-LM](https://github.com/NVIDIA/Megatron-LM) from `99f23d2` to `2da43ef`. <details> <summary>Commits</summary> <ul> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/2da43ef4c1b9e76f03b7567360cf7390e877f1b6"><code>2da43ef</code></a> Merge branch 'mmodal_eval_in_folder' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/e51a3ac1dcd366f51bcb0339ecca31790c3cfcd1"><code>e51a3ac</code></a> ADLR/megatron-lm!2491 - Move mmodal evaluation code to its own folder</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/d3c585e90ebd5937243c8d4c9d5d5cf9d61665d6"><code>d3c585e</code></a> Merge branch 'jbarker/pp_unfreeze' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/1468ab01c079d5e14888dda97d1c99d2cb62afb2"><code>1468ab0</code></a> ADLR/megatron-lm!2285 - Support --freeze-LM and --freeze-ViT with ranks that ...</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/cf25d44037af4e9d5ea723918823de9b2416a30c"><code>cf25d44</code></a> Merge branch 'boxin/nvlm_ckpt_release' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/1da9dad62b97917caacb1fd271abaed403581caa"><code>1da9dad</code></a> ADLR/megatron-lm!2494 - Add model checkpoint links</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/25b1f33035ad55eeae6b9a4367f987f1fac804dd"><code>25b1f33</code></a> Merge branch 'helenn-rope-fusion-mem-layout' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/7bb53792831d80007789ff5c60bc1798cbd34548"><code>7bb5379</code></a> ADLR/megatron-lm!2469 - Correct strides for bshd layout and revert RoPE tests...</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/b8420a1909980aa3b6750f75b2d7ab8b23338948"><code>b8420a1</code></a> Merge branch 'group_topk' into 'main'</li> <li><a href="https://github.com/NVIDIA/Megatron-LM/commit/d0df563d8739e4dfe2b0e90ba190ac389f165157"><code>d0df563</code></a> ADLR/megatron-lm!1934 - Support Device-Limited Routing and Sequence Auxiliary...</li> <li>Additional commits viewable in <a href="https://github.com/NVIDIA/Megatron-LM/compare/99f23d2f111d12b73b1fbf386c60517101ff8abe...2da43ef4c1b9e76f03b7567360cf7390e877f1b6">compare view</a></li> </ul> </details> Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting `@dependabot rebase`. [//]: # (dependabot-automerge-start) [//]: # (dependabot-automerge-end) --- <details> <summary>Dependabot commands and options</summary> You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show <dependency name> ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) </details> Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Signed-off-by: Danny <[email protected]>

Signed-off-by: Danny <[email protected]>

nvdreidenbach added the SKIP_CI Completely skips the CI pipeline label Dec 24, 2024

nvdreidenbach requested review from jstjohn, malcolmgreaves, skothenhill-nv, dorotat-nv, pstjohn, trvachov and ohadmo as code owners December 24, 2024 16:38

nvdreidenbach force-pushed the moco branch 2 times, most recently from 70fc9ce to 0af104d Compare December 24, 2024 18:29

nvdreidenbach changed the title ~~initial commit~~ Modular Co-Design Interpolants Dec 24, 2024

nvdreidenbach force-pushed the moco branch 3 times, most recently from 8c29d92 to 146985a Compare December 24, 2024 20:30

jstjohn removed the SKIP_CI Completely skips the CI pipeline label Dec 24, 2024

jstjohn reviewed Dec 24, 2024

View reviewed changes

sub-packages/bionemo-moco/environment/moco.yaml Outdated Show resolved Hide resolved

nvdreidenbach force-pushed the moco branch from 53d5b85 to 16d8ed9 Compare December 30, 2024 20:53

nvdreidenbach self-assigned this Dec 31, 2024

jstjohn reviewed Jan 2, 2025

View reviewed changes

sub-packages/bionemo-moco/scripts/clean_documentation.py Show resolved Hide resolved

nvdreidenbach force-pushed the moco branch from de1ddc9 to c3e822c Compare January 2, 2025 23:10

nvdreidenbach requested review from edawson, cspades and farhadrgh as code owners January 2, 2025 23:10

nvdreidenbach force-pushed the moco branch 2 times, most recently from 83c26f4 to 970653a Compare January 2, 2025 23:20

nvdreidenbach requested review from DejunL and guoqing-zhou as code owners January 2, 2025 23:20

nvdreidenbach and others added 21 commits January 2, 2025 15:55

initial commit

654da0b

Signed-off-by: Danny <[email protected]>

format fix

2eb91ed

Signed-off-by: Danny <[email protected]>

cleaned up conda env creation

5b4c36a

Signed-off-by: Danny <[email protected]>

update files for bionemo docs

c1cf706

Signed-off-by: Danny <[email protected]>

add temporary mistune pin to fix docs build issue (NVIDIA#559)

8a9cad1

Pins mistune to fix a jupyter notebook build issue introduced in 3.1.0 lepture/mistune#403 Bypassing review rules to fix CI due to holiday OOO Signed-off-by: Danny <[email protected]>

update tutorials for documentation

cf24f22

Signed-off-by: Danny <[email protected]>

merge from main and update local documentation

c5e7f84

Signed-off-by: Danny <[email protected]>

updated toml, removed torchdyn, added local docs readme

fd6ada4

Signed-off-by: Danny <[email protected]>

tested notebooks in bionemo conda env

bab748f

Signed-off-by: Danny <[email protected]>

fix d3pm test

d8cfed1

Signed-off-by: Danny <[email protected]>

Bump 3rdparty/NeMo from 06e6703 to 06a1491 (NVIDIA#538)

8fbba5e

Signed-off-by: Danny <[email protected]>

update base image to 24.12 (NVIDIA#553)

6b6588d

Attempts to update the base image to the most recent 24.12 release Signed-off-by: Danny <[email protected]>

added pad_time function

ffa0b5c

Signed-off-by: Danny <[email protected]>

Bump 3rdparty/NeMo from 06e6703 to 06a1491 (NVIDIA#538)

9f663d1

Signed-off-by: Danny <[email protected]>

nvdreidenbach force-pushed the moco branch from 77684d5 to 9f663d1 Compare January 2, 2025 23:56

nvdreidenbach closed this Jan 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modular Co-Design Interpolants #554

Modular Co-Design Interpolants #554

nvdreidenbach commented Dec 24, 2024

jstjohn commented Dec 24, 2024

jstjohn commented Jan 2, 2025

nvdreidenbach commented Jan 2, 2025

Modular Co-Design Interpolants #554

Modular Co-Design Interpolants #554

Conversation

nvdreidenbach commented Dec 24, 2024

Introduces modular interpolants for various popular generative model frameworks including continuous and discrete diffusion and flow matching.

Summary

Details

Usage

Testing

jstjohn commented Dec 24, 2024

jstjohn commented Jan 2, 2025

nvdreidenbach commented Jan 2, 2025