Implementing tension statistics #333

DilyOng · 2023-08-24T11:27:13Z

Description

This is a work in progress pull request aiming to address #325 and as a learning exercise on how to do pull request.

Checklist:

I have performed a self-review of my own code
My code is PEP8 compliant (flake8 anesthetic tests)
My code contains compliant docstrings (pydocstyle --convention=numpy anesthetic)
New and existing unit tests pass locally with my changes (python -m pytest)
I have added tests that prove my fix is effective or that my feature works
I have appropriately incremented the semantic version number in both README.rst and anesthetic/_version.py

codecov · 2023-08-24T11:35:08Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 100.00%. Comparing base (a4c8521) to head (ce82e13).

Additional details and impacted files

@@            Coverage Diff            @@
##            master      #333   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           36        37    +1     
  Lines         3069      3097   +28     
=========================================
+ Hits          3069      3097   +28

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

williamjameshandley · 2023-08-24T12:25:16Z

Hi @DilyOng, many thanks for taking charge of incorporating this. Let's get it plumbed into anesthetic first, and then get feedback from others on if anything is missing.

At the moment, this code is specialised to a specific naming scheme (which is what the union and intersection functions are doing), and for a wider grid.

I think we should re-organise this so that in the first instance it is more similar to @AdamOrmondroyd's suspiciousness package, but retaining the class/cacheing structure of tension_calculator.

Tasks:

Create a class TensionCalculator in a new file anesthetic/tension.py (note CamelCase rather than under_score naming)
This class should __init__ with A, B and AB which are assumed to be NestedSamples, and cache self.A = A.stats(nsamples) (same for B and AB, which computes the nested sampling statistics (see also the docs)
This should then implement logR logS d D_KL and p
You should then create a tests/test_tension.py in the same style as the other test files, which uses anesthetic.examples.perfect_ns.correlated_gaussian functions to create mock A, B and AB, alongside equations 14 to 25 from 1902.04029 to test that the tension statitistics code gets the correct answer.

williamjameshandley · 2023-08-24T13:05:07Z

I think after that it would also be good to implement a function in addition to (or possibly in place of!) the class for producing a Samples object containing columns of logR, D_KL, d, S, p, as an analogue to the output of NestedSamples.stats.

tension_calculator.py

AdamOrmondroyd · 2023-08-31T20:51:21Z

Please remember to remove (git remove) the .DSstore files you've added (they are to do with macOS file management, so not relevant to the repo)

…ation and testing it with correlated gaussian likelihoods. Found a problem with the function anesthetic.examples.perfect_ns.correlated_gaussian. The generated likelihood gaussian in the parameters is not normalised and the evidence is not unity. Need to take into account the LogLmax.

…ed_gaussian. Within the correlated_gaussian function, changed logLike function. Changed the function's description to match the fact that evidence is not unity.

…kelihood test case with the tests folder.

…nction tension_stats() for calculating tension statistics. Rewrote the test_tension_stats.py in tests to match the format of other files. It tests mock datasets with guassian likelihood. Both compatiable and incompatiable datasets have passed the test.

…dd a file for datasets pairwise_comparison, but not completed

williamjameshandley · 2024-02-28T13:41:53Z

It would be good to get this finalised and merged now that #348 is complete -- any thoughts @DilyOng

…the theoretical logR, logS and logI values sit within 3 std of the numerical solution's distribution from anesthetic, instead of testing between minimum and maximum values of the distribution.

anesthetic/tension.py

anesthetic/tension_pvalue.py

williamjameshandley

Hi @DilyOng,

I'm happy for this to be merged now. The tests are 'failing' due to the numpy< 2.0 flag, which won't be resolved until we merge #388, which needs a fastkde upgrade/deprecation

Please press 'squash and merge'. Congrats on your first PR.

anesthetic/tension.py

williamjameshandley · 2024-09-19T10:05:20Z

@AdamOrmondroyd needs to approve the changes in order for this to be merged

AdamOrmondroyd

Comments inline

AdamOrmondroyd · 2024-09-19T10:42:17Z

tests/test_tension.py

+    covAB = inv(inv(covA) + inv(covB))
+    meanAB = covAB@(solve(covA, meanA)+solve(covB, meanB))
+    dmeanAB = np.array(meanA)-np.array(meanB)
+    logLmaxAB = -1/2 * dmeanAB@solve(covA+covB, dmeanAB) + logLmaxA + logLmaxB


Missing spaces around @

AdamOrmondroyd · 2024-09-19T10:42:33Z

tests/test_tension.py

+
+    logS_std = samples_stats.logS.std()
+    logS_mean = samples_stats.logS.mean()
+    logS_exact = d/2 - 1/2*dmeanAB@solve(covA+covB, dmeanAB)


AdamOrmondroyd · 2024-09-19T10:43:43Z

tests/test_tension.py

+    bounds = [[-1, 1], [0, 3], [0, 1]]
+
+    meanA = [0.1, 0.3, 0.5]
+    covA = np.array([[.01, 0.009, 0],


Missing 0 in 0.01 (there are a few of these, I'm technically on holiday and using my phone so you can find the rest)

… computation to save computing time for high-nsamples runs

…ng stats to tension stats

lukashergt · 2024-09-27T09:26:40Z

anesthetic/tension.py

+    samples['logI'] = statsA['D_KL'] + statsB['D_KL'] - statsAB['D_KL']
+    samples.set_label('logI', r'$\ln\mathcal{I}$')


@williamjameshandley: The notation with logI matches the one in eq. (9) of Quantifying tensions in cosmological parameters. However, the Shannon information I_S = log(P/pi) already carries a logarithm in its notation, so I find this logI notation confusing. Wouldn't just I be more appropiate?

The notation logR = logS - I also matches better the Occam equation logZ = logL_P - D_KL...

Thoughts?

Added Will's example Class for tension calculator

c0946bd

Changed Version from 2.3.0 to 2.4.0

70c0aa4

williamjameshandley assigned williamjameshandley and DilyOng Aug 24, 2023

williamjameshandley added enhancement New feature or request good first issue Good for newcomers labels Aug 24, 2023

AdamOrmondroyd requested changes Aug 24, 2023

View reviewed changes

tension_calculator.py Outdated Show resolved Hide resolved

Added Class tension

ee1db5b

lukashergt and others added 12 commits September 29, 2023 17:20

Merge branch 'master' into tension

ffd5fae

version bump to 2.5.0

a3d06b5

Added logLmax to the function anesthetic.examples.perfect_ns.correlat…

38d31c2

…ed_gaussian. Within the correlated_gaussian function, changed logLike function. Changed the function's description to match the fact that evidence is not unity.

Clean up old tension statistics files. Put the correlated guassian li…

932a201

…kelihood test case with the tests folder.

Merge branch 'tension' of github.com:handley-lab/anesthetic into tension

4046bc3

Updated logLmax

de17f04

remove DS_Store

6ab8e4f

Merge branch 'master' into tension

3698669

Updated the tests/test_tension_stats.py file for flake8 compliance. A…

2f472b5

…dd a file for datasets pairwise_comparison, but not completed

Updated tests/test_tension_stats.py

5209ad2

DilyOng and others added 3 commits March 4, 2024 18:42

Updated anesthetic/tests/test_tension_stats.py. Now it tests whether …

6fb5fe0

…the theoretical logR, logS and logI values sit within 3 std of the numerical solution's distribution from anesthetic, instead of testing between minimum and maximum values of the distribution.

Merge branch 'master' into tension

151a91d

bump version to 2.9.0

b0994b3

AdamOrmondroyd reviewed Mar 5, 2024

View reviewed changes

anesthetic/tension.py Outdated Show resolved Hide resolved

anesthetic/tension.py Outdated Show resolved Hide resolved

anesthetic/tension_pvalue.py Outdated Show resolved Hide resolved

Deleted duplicate files.

aec6a25

lukashergt and others added 2 commits April 9, 2024 11:50

Merge branch 'master' into tension

e568bfd

Updated the docstrings in anesthetic/tension.py.

4947c69

williamjameshandley mentioned this pull request Sep 19, 2024

Implement tension statistics for Samples not just NestedSamples #394

Open

williamjameshandley added 6 commits September 19, 2024 09:59

Updated docstrings

7a10785

Merge branch 'master' into tension

970f431

Updated docstring to avoid Samples

69c932a

replaced \\m

7e90ac5

Further string corrections

75d3067

Further debugging docstrings

740eacf

williamjameshandley previously approved these changes Sep 19, 2024

View reviewed changes

anesthetic/tension.py Outdated Show resolved Hide resolved

Logarithmic -> Logarithm

d236f34

williamjameshandley dismissed their stale review via d236f34 September 19, 2024 10:02

williamjameshandley requested a review from AdamOrmondroyd September 19, 2024 10:04

AdamOrmondroyd requested changes Sep 19, 2024

View reviewed changes

lukashergt and others added 9 commits September 27, 2024 02:05

Merge branch 'master' into tension

79838dd

correct spelling of compatible

73d6180

surround @ by spaces

5675e25

remove occurences of missing leading 0, e.g. .01

a7b7fb5

use ln rather than log for the latex labels

e3bf650

streamline and speed up tension tests a bit

9dae22f

make tension docstring more readable

8c70f2f

optionally allow for passing a pre-computed stats instance to tension…

88b7ec6

… computation to save computing time for high-nsamples runs

simplify tension tests and add test for direct input of nested sampli…

51c0975

…ng stats to tension stats

lukashergt requested review from AdamOrmondroyd and williamjameshandley September 27, 2024 09:52

lukashergt reviewed Sep 27, 2024

View reviewed changes

lukashergt added 3 commits September 27, 2024 14:28

Update README.rst

0649d53

Update _version.py

6b07788

Merge branch 'master' into tension

ce82e13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing tension statistics #333

Implementing tension statistics #333

DilyOng commented Aug 24, 2023 •

edited

Loading

codecov bot commented Aug 24, 2023 •

edited

Loading

williamjameshandley commented Aug 24, 2023

williamjameshandley commented Aug 24, 2023 •

edited

Loading

AdamOrmondroyd commented Aug 31, 2023

williamjameshandley commented Feb 28, 2024

williamjameshandley left a comment

williamjameshandley commented Sep 19, 2024

AdamOrmondroyd left a comment

AdamOrmondroyd Sep 19, 2024

lukashergt Sep 27, 2024

AdamOrmondroyd Sep 19, 2024

lukashergt Sep 27, 2024

AdamOrmondroyd Sep 19, 2024

lukashergt Sep 27, 2024

lukashergt Sep 27, 2024

		samples['logI'] = statsA['D_KL'] + statsB['D_KL'] - statsAB['D_KL']
		samples.set_label('logI', r'$\ln\mathcal{I}$')

Implementing tension statistics #333

Are you sure you want to change the base?

Implementing tension statistics #333

Conversation

DilyOng commented Aug 24, 2023 • edited Loading

Description

Checklist:

codecov bot commented Aug 24, 2023 • edited Loading

Codecov Report

williamjameshandley commented Aug 24, 2023

williamjameshandley commented Aug 24, 2023 • edited Loading

AdamOrmondroyd commented Aug 31, 2023

williamjameshandley commented Feb 28, 2024

williamjameshandley left a comment

Choose a reason for hiding this comment

williamjameshandley commented Sep 19, 2024

AdamOrmondroyd left a comment

Choose a reason for hiding this comment

AdamOrmondroyd Sep 19, 2024

Choose a reason for hiding this comment

lukashergt Sep 27, 2024

Choose a reason for hiding this comment

AdamOrmondroyd Sep 19, 2024

Choose a reason for hiding this comment

lukashergt Sep 27, 2024

Choose a reason for hiding this comment

AdamOrmondroyd Sep 19, 2024

Choose a reason for hiding this comment

lukashergt Sep 27, 2024

Choose a reason for hiding this comment

lukashergt Sep 27, 2024

Choose a reason for hiding this comment

DilyOng commented Aug 24, 2023 •

edited

Loading

codecov bot commented Aug 24, 2023 •

edited

Loading

williamjameshandley commented Aug 24, 2023 •

edited

Loading