Collect draw-wise projection warnings and check projection convergence #478

fweber144 · 2023-11-22T13:36:15Z

This PR makes projpred catch messages and warnings from the draw-wise divergence minimizers and also check their convergence (as well as possible). Previously, projpred suppressed such messages and warnings and did not check convergence (PRs #259 and #444 started/modified the convergence checker, but it has remained a "hidden"—because unfinished—feature until now).

For deactivating these two features, global options projpred.warn_prj_drawwise and projpred.check_conv have been added (see the NEWS.md entries added here).

In my opinion, especially the convergence checker is a crucial feature, see, e.g., issue #323. The messages and warnings from the draw-wise divergence minimizers are intended as a help for the user to find out what might be going wrong without having to debug.

The convergence checks for additive models are probably still incomplete, even with this PR. I'll open a new issue for this.

Illustration:

# Setup -------------------------------------------------------------------

warn_length_orig <- options(warning.length = 8170)
devtools::load_all()

# glm_ridge(), glm_elnet() as submodel fitters ----------------------------

data("df_binom", package = "projpred")
dat <- data.frame(y = df_binom$y, df_binom$x)
fit_glm <- rstanarm::stan_glm(y ~ X1 + X2 + X3,
                              family = binomial(),
                              data = dat,
                              chains = 1,
                              iter = 500,
                              seed = 1140350788,
                              refresh = 0)

# Warning from glm_ridge():
prj <- project(fit_glm, predictor_terms = c("X1"), nclusters = 1, thresh = 0)

# Warning from glm_ridge() during the refits for performance evaluation:
vs <- varsel(fit_glm, method = "L1", nclusters_pred = 2, qa_updates_max = 2)
# Alternatively (this is a different warning, though):
vs <- varsel(fit_glm, method = "L1", nclusters_pred = 2, thresh_conv = 0)

# Warning from glm_ridge() during the forward search as well as during the
# refits for performance evaluation:
vs <- varsel(fit_glm, nclusters = 1, nclusters_pred = 2, qa_updates_max = 2)
# Alternatively (this is a different warning, though):
vs <- varsel(fit_glm, nclusters = 1, nclusters_pred = 2, thresh_conv = 0)

# Warning from glm_ridge() during the forward search:
vs <- varsel(fit_glm, nclusters = 1, nclusters_pred = 2,
             search_control = list(qa_updates_max = 2))
# Alternatively (this is a different warning, though):
vs <- varsel(fit_glm, nclusters = 1, nclusters_pred = 2,
             search_control = list(thresh_conv = 0))

# Warning from glm_ridge() during the refits for performance evaluation:
vs <- varsel(fit_glm, nclusters = 1, nclusters_pred = 2,
             search_control = list(), qa_updates_max = 2)
# Alternatively (this is a different warning, though):
vs <- varsel(fit_glm, nclusters = 1, nclusters_pred = 2,
             search_control = list(), thresh_conv = 0)

# Warning from glm_elnet() during the L1 search:
vs <- varsel(fit_glm, method = "L1", refit_prj = FALSE,
             search_control = list(thresh = 1e-330, nlambda = 1))

# MASS::polr() as submodel fitter -----------------------------------------

data("inhaler", package = "brms")
inhaler$rating <- as.factor(paste0("rtg", inhaler$rating))

fit_polr <- rstanarm::stan_polr(
  rating ~ period + carry + treat,
  data = inhaler,
  prior = rstanarm::R2(location = 0.5, what = "median"),
  chains = 1,
  iter = 500,
  seed = 1140350788,
  refresh = 0
)

# Non-convergence in MASS::polr():
prj <- project(fit_polr, predictor_terms = c("carry", "treat"), nclusters = 1,
               control = list(maxit = 1))

# Teardown ----------------------------------------------------------------

options(warn_length_orig)

for checking the convergence of a single submodel fit (not of a whole `outdmin` object).

…model.

…warnings.

thrown in case of global option `projpred.warn_submodel_fits` set to `TRUE`.

… fit) to a warning (to avoid that this causes an error; the code should still run through).

…bmodel_fits` and `projpred.check_conv` (these local arguments can be passed to top-level functions like `varsel()`, `cv_varsel()`, and `project()`).

… most complex model.

where tuning parameters may be found (which in turn is achieved by mentioning the class(es) of the submodel fits).

…l posterior draws.

…model to most complex model." Reason for the revert: For example, `class(<gam_fit>)` yields `c("gam", "glm", "lm")`, so it's indeed better to start with the most complex type of model.

…ique `stdout()` output messages as warnings.

(that's why we already needed all those `warn_expected <- "non-integer tests to `warn_prj_drawwise()` and `check_conv()` as well.

`fit_s$mgcv.conv$fully.converged` may be (or perhaps is always) `NULL`.

…nts ignored`.

(to avoid that such a minor issue as a defective convergence checker prevents the code from running through).

…ed.check_conv` (in the general package documentation).

…r the convergence checker.

thrown if the draw-wise divergence minimizer threw only informational messages).

fweber144 added 30 commits November 22, 2023 14:16

Move code from check_conv() out into a stand-alone helper function

55f23c7

for checking the convergence of a single submodel fit (not of a whole `outdmin` object).

Make the check_conv() warning more general.

2cf3db9

Fix a comment in check_conv_s().

3a2c8a3

Enhance check_conv_s() in case of an additive multilevel (GAMM) sub…

b41a4f0

…model.

Enhance check_conv_s() in case of an additive (GAM) submodel.

77b5183

Turn the stdout() output from glm_ridge() and glm_elnet() into …

40b303a

…warnings.

Use capt_mssgs_warns() in divmin().

ffe961b

Throw warnings like "Warning in foo() : some warning starting here:".

3abb890

Don't use try() where not necessary.

6ef0cbe

Check messages and warnings on a draw-by-draw basis.

fdad1fd

Allow re-use of object mssgs_warns_capts and enhance the warning

71bda6d

thrown in case of global option `projpred.warn_submodel_fits` set to `TRUE`.

check_conv_s(): Turn the error (in case of an unrecognized submodel…

c5cf96a

… fit) to a warning (to avoid that this causes an error; the code should still run through).

Use a default of TRUE for global option projpred.check_conv.

f13f177

Move the check_conv() call to divmin().

3efdd68

Move option projpred.check_conv into check_conv().

5e95b72

Return NULL consistently (see warn_pareto()).

7e21049

Create function warn_submodel_fits().

f8b42db

Add local arguments corresponding to global options `projpred.warn_su…

b86982c

…bmodel_fits` and `projpred.check_conv` (these local arguments can be passed to top-level functions like `varsel()`, `cv_varsel()`, and `project()`).

Adapt divmin_augdat() analogously to divmin().

9fd72c7

check_conv_s(): Enhance a comment for subfits.

313ac99

check_conv_s(): Re-order the if cases from least complex model to…

01830f0

… most complex model.

Enhance the warning in check_conv() by giving a more precise hint

8446064

where tuning parameters may be found (which in turn is achieved by mentioning the class(es) of the submodel fits).

Filter out some warnings also in divmin().

efed1dd

Tests: Remove unnecessary braces.

b6e5ef7

Tests: Use the global option to suppress warnings collected across al…

13a1958

…l posterior draws.

Tests: Don't use the convergence checker.

5ad503d

Revert "check_conv_s(): Re-order the if cases from least complex …

d55b66f

…model to most complex model." Reason for the revert: For example, `class(<gam_fit>)` yields `c("gam", "glm", "lm")`, so it's indeed better to start with the most complex type of model.

Extend check_conv_s() to polr fits.

60aa9fb

Extend check_conv_s() to clmm fits.

ca247dd

Extend check_conv_s() to multinom fits.

24477ca

fweber144 added 16 commits November 22, 2023 14:16

Extend check_conv_s() to mmblogit fits.

c9e6650

search_L1_surrogate() and fit_glm_ridge_callback(): Only throw un…

6a7a5e5

…ique `stdout()` output messages as warnings.

L1 search: Enhance the warning message.

d8f5585

warn_submodel_fits(): Fix the warning message.

29ec582

Replace all occurrences of warn_submodel_fits by warn_prj_drawwise.

08302f3

Minor enhancements for warn_prj_drawwise().

94e7a31

Tests: testthat handles stderr() in its own way

e0426c1

(that's why we already needed all those `warn_expected <- "non-integer tests to `warn_prj_drawwise()` and `check_conv()` as well.

The tests revealed that for GAMs with the binomial family,

9643525

`fit_s$mgcv.conv$fully.converged` may be (or perhaps is always) `NULL`.

fixup! Tests: testthat handles stderr() in its own way

cb0cf84

fit_glmer_callback(): Avoid the lme4 warning `unused control argume…

b769378

…nts ignored`.

Tests: Unfortunately, we need to suppress warnings.

b51fcfe

Catch errors when calling check_conv_s() and throw a warning instead

ef20193

(to avoid that such a minor issue as a defective convergence checker prevents the code from running through).

Fix the warning message from warn_prj_drawwise().

49e24e0

Docs: Mention global options projpred.warn_prj_drawwise and `projpr…

6fab7c5

…ed.check_conv` (in the general package documentation).

Add NEWS.md entries for the collection of draw-wise warnings and fo…

c6a3b5e

…r the convergence checker.

Formulate check_conv()'s warning more cautiously (because it is also

ab6544e

thrown if the draw-wise divergence minimizer threw only informational messages).

fweber144 merged commit 97c5bea into stan-dev:master Nov 22, 2023

fweber144 deleted the check_conv_public branch November 22, 2023 14:46

fweber144 mentioned this pull request Nov 22, 2023

Convergence checks for additive submodels #479

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Collect draw-wise projection warnings and check projection convergence #478

Collect draw-wise projection warnings and check projection convergence #478

fweber144 commented Nov 22, 2023

Collect draw-wise projection warnings and check projection convergence #478

Collect draw-wise projection warnings and check projection convergence #478

Conversation

fweber144 commented Nov 22, 2023