MNPE class similar to MNLE #1362

dgedon · 2025-01-10T15:57:48Z

Implementation of mixed NPE where we have some continuous parameters theta followed by one (or multiple with this PR #1269) discrete parameters. The observation space is fully continuous.

Deprecated mnle.py in net_builders und unified mnle/mnpe as mixed_nets.py.

dgedon · 2025-01-17T10:22:21Z

Update:

MNPE and tests are implemented
for test with Bernoulli prior, I had to change mcmc_transforms to handle discrete distirbution. As default we just compute mean/std for discrete distributions
currently MNPE with embedding nets does not work yet. Gives some backwards inplace operations error that I couldn't solve yet.

janfb · 2025-02-25T18:59:53Z

@dgedon #1269 is now merged 🙌

…ODO: MNPE class + test

…dent; embedding net in mnpe not working yet

…this PR)

…not allow gpu handling yet though

dgedon · 2025-03-18T14:30:59Z

Updates:

bug fix so everything works now. Essentially need to handle normalization with care when switching from mnle to mnpe
remove unnecessary GPU handling (hackathon task). This limits MultipleIndependent as not allowing device argument yet

janfb

looks good overall, except one central question about the call signature of MNPE.

sbi/neural_nets/estimators/mixed_density_estimator.py

sbi/inference/trainers/npe/mnpe.py

janfb · 2025-03-18T13:59:21Z

sbi/neural_nets/estimators/mixed_density_estimator.py

+    This estimator combines a Categorical net and a neural density estimator to model
+    data with mixed types (discrete and continuous), e.g., as they occur in
+    decision-making models. It can be used for both likelihood and posterior estimation
+    of mixed data.


Suggested change

This estimator combines a Categorical net and a neural density estimator to model

data with mixed types (discrete and continuous), e.g., as they occur in

decision-making models. It can be used for both likelihood and posterior estimation

of mixed data.

This estimator combines a categorical mass estimator and a density estimator to model

variables with mixed types (discrete and continuous). It can be used for both likelihood

estimation (e.g., for discrete decisions and continuous reaction times in decision-making

models) or posterior estimation (e.g., for models that have both discrete and continuous

parameters).

janfb · 2025-03-18T13:59:49Z

sbi/neural_nets/estimators/mixed_density_estimator.py

-            """The forward method is not implemented for MNLE, use '.sample(...)' to
-            generate samples though a forward pass."""
+            """The forward method is not implemented for mixed neural density
+            estimation,use '.sample(...)' to generate samples though a forward


Suggested change

estimation,use '.sample(...)' to generate samples though a forward

estimation, use '.sample(...)' to generate samples though a forward

janfb · 2025-03-18T22:20:20Z

sbi/neural_nets/net_builders/mixed_nets.py

+def build_mnle(
+    batch_x: Tensor,
+    batch_y: Tensor,
+    **kwargs,
+) -> MixedDensityEstimator:
+    """Returns a mixed neural likelihood estimator.
+
+    This estimator models p(x|theta) where x contains both continuous and discrete data.
+
+    Args:
+        batch_x: Batch of xs (data), used to infer dimensionality.
+        batch_y: Batch of ys (parameters), used to infer dimensionality.
+        **kwargs: Additional arguments passed to _build_mixed_density_estimator.
+
+    Returns:
+        MixedDensityEstimator for MNLE.
+    """
+    return _build_mixed_density_estimator(
+        batch_x=batch_x, batch_y=batch_y, mode="mnle", **kwargs
+    )
+
+
+def build_mnpe(
+    batch_x: Tensor,
+    batch_y: Tensor,
+    **kwargs,
+) -> MixedDensityEstimator:
+    """Returns a mixed neural posterior estimator.
+
+    This estimator models p(theta|x) where x contains both continuous and discrete data.


I am confused by these two call functions. maybe I am missing something, but couldn't we just call _build_mixed_density_estimator with batch_x and batch_y swapped for MNPE and MNLE?

To me it seems this swapping is not happening, i.e., we need to make sure that in MNPE we are only embedding x and not theta, and v.v. in MNLE. Let's discuss tomorrow.

True, we could remove both functions and just use _build_mixed_density_estimator.

The swapping of x/theta is happening because the function is once called as likelihood_nn and once as posterior_nn.

…onflict with tutorial

dgedon · 2025-03-19T09:14:12Z

Update:

simplify _build_mixed_density_estimator by not having mode='mnpe'/'mnle'
add default log_transform_x as kwarg to build_mnle and build_mnpe

janfb

some more comments on the tests and the refactoring.

I am suggestion a toy example with ground-truth posterior for the MNPE scenario to test the accuracy.

sbi/neural_nets/net_builders/mnle.py

sbi/utils/sbiutils.py

janfb · 2025-03-19T08:58:47Z

sbi/utils/sbiutils.py

+                try:
+                    prior_mean = prior.mean.to(device)
+                    prior_std = prior.stddev.to(device)
+                except (NotImplementedError, AttributeError):
+                    warnings.warn(
+                        "The passed discrete prior has no mean or stddev attribute, "
+                        "estimating them from samples to build affine standardizing "
+                        "transform.",
+                        stacklevel=2,
+                    )
+                    theta = prior.sample(torch.Size((num_prior_samples_for_zscoring,)))
+                    prior_mean = theta.mean(dim=0).to(device)
+                    prior_std = theta.std(dim=0).to(device)


move this into a small function to avoid code duplication?

janfb · 2025-03-19T09:05:09Z

tests/mnpe_test.py

+    x = mixed_param_simulator(theta)
+
+    # Build estimator manually
+    theta_embedding = FCEmbedding(1, 1)  # simple embedding net, 1 continuous parameter


this should be an x_embedding to avoid confusion

tests/mnpe_test.py

janfb · 2025-03-19T09:05:52Z

tests/mnpe_test.py

+        log_transform_x=False,
+    )
+    trainer = MNPE(density_estimator=density_estimator)
+    trainer.append_simulations(theta, x).train(max_num_epochs=5)


max_num_epochs=1

to speed up tests.

janfb · 2025-03-19T09:07:16Z

tutorials/Example_01_DecisionMakingModel.ipynb

please remove all diffs here. we probably want to have some kind of tutorial or how-to-guide with MNPE, but let's wait for the new documentation setup.

tests/mnpe_test.py

dgedon · 2025-03-20T09:09:56Z

Update:
added an accuracy test for MNPE consisting of 2 Gaussians with varying mean. The observation is one Gaussian based on a Bernoulli "selection" variable. The analytic reference posterior is compared to the MNPE estimate using C2ST.

codecov · 2025-03-20T10:10:01Z

Codecov Report

Attention: Patch coverage is 83.92857% with 9 lines in your changes missing coverage. Please review.

Project coverage is 79.09%. Comparing base (46ccee0) to head (bfaabe8).
Report is 12 commits behind head on main.

Files with missing lines	Patch %	Lines
sbi/utils/sbiutils.py	69.56%	7 Missing ⚠️
sbi/inference/trainers/npe/mnpe.py	95.23%	1 Missing ⚠️
sbi/utils/user_input_checks_utils.py	50.00%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #1362       +/-   ##
===========================================
- Coverage   89.62%   79.09%   -10.53%     
===========================================
  Files         121      122        +1     
  Lines        9347     9394       +47     
===========================================
- Hits         8377     7430      -947     
- Misses        970     1964      +994

Flag	Coverage Δ
unittests	`79.09% <83.92%> (-10.81%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
sbi/inference/__init__.py	`100.00% <100.00%> (ø)`
sbi/inference/trainers/npe/__init__.py	`100.00% <100.00%> (ø)`
.../neural_nets/estimators/mixed_density_estimator.py	`94.73% <ø> (-1.76%)`	⬇️
sbi/neural_nets/factory.py	`90.90% <100.00%> (ø)`
sbi/neural_nets/net_builders/__init__.py	`100.00% <100.00%> (ø)`
sbi/neural_nets/net_builders/mixed_nets.py	`97.05% <100.00%> (ø)`
sbi/inference/trainers/npe/mnpe.py	`95.23% <95.23%> (ø)`
sbi/utils/user_input_checks_utils.py	`88.67% <50.00%> (-1.13%)`	⬇️
sbi/utils/sbiutils.py	`78.38% <69.56%> (-9.16%)`	⬇️

... and 38 files with indirect coverage changes

🚀 New features to boost your workflow:

❄ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

janfb

great to have the test now! 🎉

added a some comments for the mnle defaults and some suggestions for the test.

sbi/neural_nets/net_builders/mixed_nets.py

tests/mnpe_test.py

janfb

looks good!

two final comments.

janfb · 2025-03-20T11:14:19Z

tests/mnpe_test.py

    theta_true = torch.cat(
        (
-            torch.rand(batch_size, 1),
-            torch.ones(batch_size, 1),
+            torch.rand(batch_size, 2),
+            torch.bernoulli(0.8 * torch.ones(batch_size, 1)),
        ),
        dim=1,
    )


you could just do prior.sample((1,)) no?

we usually call it theta_o :)

tests/mnpe_test.py

janfb

Two final final comments. Good to be merged afterwards. Really cool to have this implemented now 🔥 🎉

sbi/neural_nets/net_builders/mixed_nets.py

janfb mentioned this pull request Mar 13, 2025

Discrete (mixed) density estimators #907

Closed

dgedon added 8 commits March 18, 2025 12:20

wip: build_mnpe integrated, build_mnle refactored, mnle_test fails; T…

a567fc8

…ODO: MNPE class + test

wip: added MNPE class and test case for it

b9a5b92

wip: added MNPE class and test case for it, not working yet

fe274c9

fix: tests, embned+maf not working

6016f07

wip: fixed discrete data issue in mcmc_transform with MultipleIndepen…

736e3ac

…dent; embedding net in mnpe not working yet

wip: remove unnecessary helper function (introduced while working on …

0934e86

…this PR)

bug fix with normalization when using embedding nets

6412964

revert unnecessary gpu handling things. Now MultipleIndependent does …

48f0e63

…not allow gpu handling yet though

dgedon force-pushed the mnpe branch from eb7ad8b to 48f0e63 Compare March 18, 2025 11:29

dgedon requested review from janfb and michaeldeistler March 18, 2025 11:30

janfb reviewed Mar 18, 2025

View reviewed changes

dgedon added 2 commits March 19, 2025 09:39

review changes: comments, missing import, static type check

4f9524f

simplify mixed nets (default logtransform set for mnle/mnpe), merge c…

7b140b4

…onflict with tutorial

janfb reviewed Mar 19, 2025

View reviewed changes

dgedon added 5 commits March 19, 2025 11:45

remove legacy mnle.py that was not interacted with by users

b475446

refactor prior transform function (code duplication)

3d7f870

cleanup cosmetics

07783d2

cleanup cosmetics (again)

8b94843

add accuracy test with MoG and analytic reference posterior

2c35ea3

dgedon added 3 commits March 20, 2025 10:26

revert tutorial file

03cbde8

old tutorial to remove conflict

88827a3

jupyter merge conflict

43d7e52

janfb reviewed Mar 20, 2025

View reviewed changes

incorporate review comments

b9356ad

dgedon requested a review from janfb March 20, 2025 11:10

janfb reviewed Mar 20, 2025

View reviewed changes

mark gpu test as xfail

f40a10b

dgedon requested review from janfb and removed request for michaeldeistler March 20, 2025 11:32

janfb approved these changes Mar 20, 2025

View reviewed changes

sbi/neural_nets/net_builders/mixed_nets.py Outdated Show resolved Hide resolved

sbi/neural_nets/net_builders/mixed_nets.py Outdated Show resolved Hide resolved

commenting

bfaabe8

dgedon merged commit 312e9ef into sbi-dev:main Mar 20, 2025
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MNPE class similar to MNLE #1362

MNPE class similar to MNLE #1362

dgedon commented Jan 10, 2025

dgedon commented Jan 17, 2025

janfb commented Feb 25, 2025

dgedon commented Mar 18, 2025

janfb left a comment

janfb Mar 18, 2025

janfb Mar 18, 2025

janfb Mar 18, 2025

dgedon Mar 19, 2025

dgedon commented Mar 19, 2025

janfb left a comment

janfb Mar 19, 2025

janfb Mar 19, 2025

janfb Mar 19, 2025

janfb Mar 19, 2025

dgedon commented Mar 20, 2025

codecov bot commented Mar 20, 2025 •

edited

Loading

janfb left a comment

janfb left a comment

janfb Mar 20, 2025

janfb Mar 20, 2025

janfb left a comment

-    This estimator combines a Categorical net and a neural density estimator to model
-    data with mixed types (discrete and continuous), e.g., as they occur in
-    decision-making models. It can be used for both likelihood and posterior estimation
-    of mixed data.
+    This estimator combines a categorical mass estimator and a density estimator to model
+    variables with mixed types (discrete and continuous). It can be used for both likelihood
+    estimation (e.g., for discrete decisions and continuous reaction times in decision-making
+    models) or posterior estimation (e.g., for models that have both discrete and continuous
+    parameters).

	estimation,use '.sample(...)' to generate samples though a forward
	estimation, use '.sample(...)' to generate samples though a forward

MNPE class similar to MNLE #1362

MNPE class similar to MNLE #1362

Conversation

dgedon commented Jan 10, 2025

dgedon commented Jan 17, 2025

janfb commented Feb 25, 2025

dgedon commented Mar 18, 2025

janfb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dgedon commented Mar 19, 2025

janfb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dgedon commented Mar 20, 2025

codecov bot commented Mar 20, 2025 • edited Loading

Codecov Report

janfb left a comment

Choose a reason for hiding this comment

janfb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

janfb left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 20, 2025 •

edited

Loading