[ENH] Add the Option to Choose Number of Draws in PPD #871

jgyasu · 2025-01-20T11:38:55Z

Closes #739

This PR modifies the predict method in the Model class such that it now accepts an optional argument num_draws and then randomly selects the number of draws from the posterior predictive distribution, i.e, num_draws from the total draws without replacement. If the num_draws is more than the total draws then it gives a warning and falls back to the total draws.

Passes pylint
Passes black

I will appreciate help with the testing of the method with this new parameter.

tomicapretto · 2025-01-20T11:58:31Z

@jgyasu thanks for the contribution! I want to check that I understand the logic. You're sub-setting the InferenceData object before doing any computation. Thus, everything that happens later, like computation of likelihood parameters and generation of predictive draws, is based on that number of draws. Is that correct?

tomicapretto · 2025-01-20T12:00:08Z

Two suggestions

Add the option to pass a random_seed for reproducibility
Raise an error if num_draws is not None and inplace is True. If users do that, they will be dropping random draws that they won't be able to recover.

jgyasu · 2025-01-20T12:33:48Z

@jgyasu thanks for the contribution! I want to check that I understand the logic. You're sub-setting the InferenceData object before doing any computation. Thus, everything that happens later, like computation of likelihood parameters and generation of predictive draws, is based on that number of draws. Is that correct?

Yes, right. All the computation remains same, they just run on the subset of the total draws.

jgyasu · 2025-01-20T12:34:39Z

Two suggestions

Add the option to pass a random_seed for reproducibility

Raise an error if num_draws is not None and inplace is True. If users do that, they will be dropping random draws that they won't be able to recover.

Sure, thanks! I will apply these changes and make a new commit.

bambi/models.py

tomicapretto · 2025-01-21T13:21:06Z

@jgyasu thanks for the additional changes. I asked another change, and I also realized we should add a test for this case. Just let me know if you need any guidance. But what we want is a test for this num_draws argument. We want to make sure it works as expected (using fewer draws when it's smaller than the number of samples, raising an error when num_draws is not None and inplace is True, etc. etc.

jgyasu · 2025-01-21T13:24:07Z

@jgyasu thanks for the additional changes. I asked another change, and I also realized we should add a test for this case. Just let me know if you need any guidance. But what we want is a test for this num_draws argument. We want to make sure it works as expected (using fewer draws when it's smaller than the number of samples, raising an error when num_draws is not None and inplace is True, etc. etc.

Thanks, I will make the changes requested. And yes, this is what I was thinking! I'll try to write the tests and if I face any problem, I'll let you know!

jgyasu · 2025-01-22T13:52:46Z

@jgyasu thanks for the additional changes. I asked another change, and I also realized we should add a test for this case. Just let me know if you need any guidance. But what we want is a test for this num_draws argument. We want to make sure it works as expected (using fewer draws when it's smaller than the number of samples, raising an error when num_draws is not None and inplace is True, etc. etc.

Hi @tomicapretto I made the requested change and I was going through the tests directory, from my understanding the test should go in the test_models.py, please correct me if I am wrong. And there, I have to add tests similar to:

class FitPredictParent:
    def fit(self, model, **kwargs):
        return model.fit(tune=TUNE, draws=DRAWS, **kwargs)

    def predict_oos(self, model, idata, data=None):
        # Reuse the original data
        if data is None:
            data = model.data.head()
        return model.predict(idata, kind="response", data=data, inplace=False)

Should I make an additional class or add an additional method changing the arguments in the predict_oos method?

tomicapretto · 2025-01-24T12:52:46Z

@jgyasu thanks!

You don't need to create a new class. The one you shared is a parent class used by classes for specific model families.
You can create one or two functions like this one

https://github.com/bambinos/bambi/blob/6b66691ed08d88d98acbb5a229f5aeb258e7ab44/tests/test_models.py#L1319C1-L1340C42

In the .predict() method you pass the differrent options for the parameters you implemented and check the behavior is as intended. For example, you need to catch an error when num_draws is not None and inplace is True. For that you can see

bambi/tests/test_model_construction.py

Line 506 in 6b66691

with pytest.raises(ValueError, match="Model is not built yet"):

. For the num_draws, you can check the number of draws in the returned idata matches the one you passed, etc.

jgyasu · 2025-01-26T13:25:57Z

@jgyasu thanks!

You don't need to create a new class. The one you shared is a parent class used by classes for specific model families. You can create one or two functions like this one

https://github.com/bambinos/bambi/blob/6b66691ed08d88d98acbb5a229f5aeb258e7ab44/tests/test_models.py#L1319C1-L1340C42

In the .predict() method you pass the differrent options for the parameters you implemented and check the behavior is as intended. For example, you need to catch an error when num_draws is not None and inplace is True. For that you can see

bambi/tests/test_model_construction.py

Line 506 in 6b66691

with pytest.raises(ValueError, match="Model is not built yet"):

. For the num_draws, you can check the number of draws in the returned idata matches the one you passed, etc.

Hi, I have written the tests but I was trying to test them on my system and I tried installing all the dependencies but I get some errors during the building of pytensor.

I am on linux and in an isolated conda environment, I get this error - https://gist.github.com/jgyasu/0c767099cfa201169a703b7b886d76d9

I am wondering if I am missing some steps during installation, I will appreciate any help. Thanks!

GStechschulte · 2025-01-28T10:28:47Z

Hey @jgyasu thanks for the error output. Sorry for the naive recommendation, but have you tried creating a new environment and installing everything from scratch?

tomicapretto · 2025-01-30T22:31:55Z

@jgyasu it's usually the case that it's painful to work with PyMC and PyTensor, especially when installing new things.

What environment manager are you using? If you're using conda, it should be as easy as

Assuming you are at the root of the bambi repo, you can do

conda create --name bambi-dev python=3.11
conda activate bambi-dev
conda install -c conda-forge pymc
pip install -e . # to install bambi in editable mode
pip install -e .[dev] # dev deps 
pip install -e .[jax] # jax deps

unfortunately it's not possible to do something like pip install -e .[jax, dev] (as far as I know)

jgyasu added 2 commits January 20, 2025 16:42

Custom Number of Draws in Posterior Predictive Distribution

a4ce4f7

Custom Number of Draws in Posterior Predictive Distribution

715b8b5

Add and Raise ValueError

af0171e

tomicapretto requested changes Jan 21, 2025

View reviewed changes

bambi/models.py Outdated Show resolved Hide resolved

jgyasu added 2 commits January 22, 2025 19:04

Set random_seed to None

9b46346

Merge branch 'main' into num-draws

6346897

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] Add the Option to Choose Number of Draws in PPD #871

[ENH] Add the Option to Choose Number of Draws in PPD #871

jgyasu commented Jan 20, 2025 •

edited

Loading

tomicapretto commented Jan 20, 2025

tomicapretto commented Jan 20, 2025

jgyasu commented Jan 20, 2025

jgyasu commented Jan 20, 2025

tomicapretto commented Jan 21, 2025

jgyasu commented Jan 21, 2025

jgyasu commented Jan 22, 2025

tomicapretto commented Jan 24, 2025

jgyasu commented Jan 26, 2025

GStechschulte commented Jan 28, 2025

tomicapretto commented Jan 30, 2025

[ENH] Add the Option to Choose Number of Draws in PPD #871

Are you sure you want to change the base?

[ENH] Add the Option to Choose Number of Draws in PPD #871

Conversation

jgyasu commented Jan 20, 2025 • edited Loading

tomicapretto commented Jan 20, 2025

tomicapretto commented Jan 20, 2025

jgyasu commented Jan 20, 2025

jgyasu commented Jan 20, 2025

tomicapretto commented Jan 21, 2025

jgyasu commented Jan 21, 2025

jgyasu commented Jan 22, 2025

tomicapretto commented Jan 24, 2025

jgyasu commented Jan 26, 2025

GStechschulte commented Jan 28, 2025

tomicapretto commented Jan 30, 2025

jgyasu commented Jan 20, 2025 •

edited

Loading