Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Highest Posterior Density #87

Open
hugobowne opened this issue Feb 15, 2020 · 6 comments
Open

Highest Posterior Density #87

hugobowne opened this issue Feb 15, 2020 · 6 comments

Comments

@hugobowne
Copy link
Collaborator

how do you like to describe HPD @ericmjl @justinbois @betanalpha ?

@justinbois
Copy link

http://bebi103.caltech.edu.s3-website-us-east-1.amazonaws.com/2020b/content/lecture_notes/lecture_04/posterior_summaries.html

From that, "If we’re considering a 95% credible interval, the HPD interval is the shortest interval that contains 95% of the probability mass of the posterior."

@ericmjl
Copy link
Owner

ericmjl commented Feb 17, 2020

I was struggling for the right words, thanks @justinbois!

@betanalpha
Copy link

I don't describe the HPD at all because it's not a well-defined probabilistic object, i.e. it can't be defined as expectation values (or differences of expectation values in the case of quantiles) and hence can't be put into a decision theoretic framework. The subtlety in @justinbois's description is that "shortest" is not well-defined and will in general change when you reparameterize the ambient space. For me the additional computational problem with HPD estimators in the case of nontrivial target distributions makes the concept to confusing to be worth considering at all.

@ericmjl
Copy link
Owner

ericmjl commented Feb 17, 2020

What if we reported just the interval from the 2.5th to the 97.5th percentile instead?

@justinbois
Copy link

Yes, @ericmjl , that's what @betanalpha advocates, and I usually do, too. The percentiles will not change for any change of variables for which the function used in changing the variables is monotonic.

@betanalpha
Copy link

To be honest 2.5% and 97.5% require way too many effective samples to be reasonable defaults. If you need 100 effective samples below the 2.5% quantile to estimate it with any decent precision then you need about 4000 effective samples total, which requires far more work than needed for most other estimation goals. When using intervals I just default to 10%-90%, and then if showing nested intervals continuing on to 20%-80%, 30%-70%, 40%-60%, and then the median. If I just need to summarize a one-dimensional marginal then I'll usually use a histogram, looking at the ECDF if there's any suspicion of odd effects like repeated values.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants