Bayesian multiple membership multilevel models with parameterizable weights using 'JAGS'.

The rmm package provides an interface to fit Bayesian multiple membership multilevel models with parameterizable weights using JAGS.

rmm(
  formula,
  family = "Gaussian",
  priors = NULL,
  inits = NULL,
  n.iter = 1000,
  n.burnin = 500,
  n.thin = max(1, floor((n.iter - n.burnin)/1000)),
  chains = 3,
  seed = NULL,
  run = T,
  parallel = F,
  monitor = T,
  transform = F,
  modelfile = F,
  data = NULL
)

Arguments

formula: A symbolic description of the model in form of an R formula. More details below.
family: Character vector. Currently supported are "Gaussian", "Binomial", "Weibull", and "Cox". Not yet implemented: "CondLogit"
priors: A list with parameter names as tags and their prior specification as values. More details below.
inits: A list with parameter as tags and their initial values as values. This list will be used in all chains. If NULL, JAGS and rmm select appropriate inits.
n.iter: Total number of iterations.
n.burnin: Number of iterations that will be discarded.
n.thin: Thinning rate.
chains: Number of chains.
seed: A random number.
run: A logical value (True or False) indicating whether JAGS should estimate the model.
monitor: A logical value (True or False). If True, weights, random effects, predictions, and JAGS output is saved as well.
transform: Character vector or FALSE. Specifying center or std to center or standardize continuous predictors before estimation. Specifying std2 will divide by two times the standard deviation, so that regression coefficients are comparable to those of binary predictors (Gelman 2008).
modelfile: Character vector or TRUE|False. If TRUE, the JAGS model is saved in rmm/temp/modelstring.txt. If a file path is supplied as string, rmm will just create the data structure and use the provided modelfile. Run .libPaths() to see where R packages are stored.
data: Dataframe object. The dataset must have level 1 as unit of analysis. More details below.

Value

A list with 7 elements: reg.table, w, re.l1, re.l3, pred, input, jags.out. If monitor=F, only the regression table is returned. If monitor=T, the predicted weights, level-1 random effects (if specified in the model), level-3 random effects (if specified in the model), predicted values of the dependent variable, and the internally created variables are returned. The last element of the list is the unformatted Jags output.

Details

The core function of rmm, rmm, allows users to specify a multiple membership multilevel model with parameterizable weights. The package generates the JAGS code to fit the model and processes the JAGS output to aid the interpretation of the model results.

In order to fit the models, JAGS must be installed.

The rmm package estimates models with a complex, nonstandard multilevel structure, known as a multiple membership multilevel structure. Unlike other packages and programs for estimating multiple membership multilevel models, such as brms or MLwiN, rmm allows users to parameterize the weights using a weight function specified in formula syntax. This feature enables researchers to investigate how the effects of lower-level units aggregate to a higher level (micro-macro link).

Accessible introductions to multiple membership models can be found in the report by Fielding and Goldstein (2006) and the book chapter by Beretvas (2010). More advanced discussions are provided in Goldstein's multilevel modeling textbook (2011, Chapter 13), the book chapters on multiple membership models by Rasbash and Browne (2001, 2008), the paper by Browne et al. (2001), and the report by Leckie (2013).

General formula structure

Y ~ 1 + mm(id(l1id, l2id), mmc(X.L1), mmw(w ~ 1 / N, constraint=1)) + X.L2 + X.L3 + hm(id=l3id, name=l3name, type=FE, showFE=F)

Dependent variable: Y
Multiple membership object: mm() to analyze how the effects of level-1 predictors from multiple constituting members aggregate to level 2
Level-2 predictors: X.L2, being something like X1 + ... + XN
Level-3 predictors: X.L3, being something like X1 + ... + XN
Hierarchical membership object: hm() to recognize that level 2 units embedded in a third level

Currently supported dependent variables / link functions

Gaussian continuous variable Y
Binomial outcome for logistic regression Y
Conditional logistic outcomes ???
Weibull survival time: Surv(survivaltime, event)
Cox survival time: Surv(survivaltime, event)

Vector of level-2 predictors

An intercept at the main level 2 is added whether or not a 1 is specified in the beginning. Interaction terms have to be included as separate variables.

Multiple membership object mm()

id() to indicate level-1 and level-2 ids
mmc() to specify level-1 predictors. No intercept allowed. Interaction terms have to be included as separate variables. Can be left empty.
mmw() to specify the weight function (micro-macro link). The function can be nonlinear and contain variables but needs to be identifiable. To give a few examples: w ~ 1/N specifies mean-aggregation, with N being a variables that indicates the number of level-1 units per level-2 entity. If no mmw() is specified, w ~ 1/N is assumed. In Rosche (2025), I propose to use w ~ 1/N^exp(-(b1*X.W)) as the general form for weight functions. This function ensures that weights are limited by 0 and 1. Parameters can be specified as b1, b2, ..., bn.

Two different identification restrictions are provided:
- mmw(w ~ ..., constraint=1): constraint=1 restricts the weights to sum to 1 for each level-2 entity. (default)
- mmw(w ~ ..., constraint=2): constraint=2 restricts the weights to sum to the total number of level-2 entities over the whole dataset, allowing some level-2 entities to have weights smaller/larger than 1.
- mmw(w ~ ..., ar=TRUE): Allows random effects of level-1 units to change across memberships in level-2 entities.
- mmw(w ~ ..., ar=FALSE): Assumes all level-1 units to have one random effect

Hierachical membership object hm()

id=l3id to indicate level-3 id
name=l3name to specify value labels for level 3 units.
type=RE (default) or type=FE to choose between random- or fixed effect estimation. If RE is chosen, level 3 predictors can be added. If FE is chosen, each level 3 unit has its own intercept and level 3 predictors are removed. If showFE=TRUE the fixed effects are reported, otherwise omitted (default). The first l3id is the base.

More details on changing priors

Priors of the following parameters may be changed: b.l1, b.l2, b.l3, b.w, tau.l1, tau.l2, tau.l3. The priors are specified as a character vector: priors=c("b.l1~dnorm(0,0.01)"). In this example, the priors of all level-1 regression coefficients are changed to a more informative prior that has a smaller variance than the default (dnorm(0,0.0001)). I refer to the JAGS manual for more details on possible prior specifications.

More details on the weight function

...

More details on constructing the data

...

Tips

Error in update.jags(model, n.iter, ...) : Error in node w[1285] Invalid parent values The weight function must be designed such that the distribution of weights is in line with the priors for all other parameters. This error could, for instance, be caused if weights can be negative but negative weights cause the distribution of other parameters to be outside of the distribution of their priors. Carefully designing the weight function so that it is properly bounded may therefore help. Specifying transform="std" may help as well.
Including weight regressors demands a lot from your data It is therefore a good idea to start with slightly more informative priors. I suggest starting with priors = list("b.w"="dnorm(0,0.1)" and then increasing the variance step by step.

...

References

Rosche, B. (2025). A multilevel model for coalition governments: Uncovering dependencies within and across governments due to parties. https://doi.org/10.31235/osf.io/4bafr

Author

Benjamin Rosche <benjamin.rosche@gmail.com>

Examples

data(coalgov)
m1 <- rmm(Surv(govdur, earlyterm, govmaxdur) ~ 1 + mm(id(pid, gid), mmc(fdep), mmw(w ~ 1/n, constraint=1)) + majority + hm(id=cid, name=cname, type=RE, showFE=F),
          family="Weibull", monitor=T, data=coalgov)
#> Error in dissectFormula(formula, family, data): family="Weibull" takes two variables on the left-hand side: 'Surv(survtime, event)'
m1$reg.table # the regression output
#> Error: object 'm1' not found
m1$w         # the estimated weights
#> Error: object 'm1' not found
m1$re.l1     # the level-1 random effects
#> Error: object 'm1' not found
m1$re.l3     # the level-3 random effects
#> Error: object 'm1' not found
m1$pred      # posterior predictions of the dependent variable (linear predictor for \code{family="Gaussian"}, survival time for \code{family="Weibull"})
#> Error: object 'm1' not found
m1$input     # internal variables
#> Error: object 'm1' not found
jags.out <- m1$jags.out # JAGS output
#> Error: object 'm1' not found
m1 %>% summary() # regression output
#> Error: object 'm1' not found
monetPlot(m1, "b.l1") # monetPlot to inspect the posterior distribution of the model parameters
#> Error: object 'm1' not found