22–25 Jul 2025
EAM2025
Atlantic/Canary timezone

A Cautionary Note on Simulating Multilevel Data

23 Jul 2025, 11:30
30m
Faculty of Social Sciences and Communication. (The Pyramid)/. - Foyer (Faculty of Social Sciences and Communication. (The Pyramid))

Faculty of Social Sciences and Communication. (The Pyramid)/. - Foyer

Faculty of Social Sciences and Communication. (The Pyramid)

300
Show room on map
Poster Statistical analyses Poster Session 1

Speakers

Diego Iglesias (Universidad Autónoma de Madrid) Miguel A. Sorrel (Universidad Autónoma de Madrid) Ricardo Olmos (Universidad Autónoma de Madrid)

Abstract

Simulation studies are widely used in methodological research to assess the performance of statistical methods in scenarios relevant to the behavioral and social sciences. In some cases, the real-world scenario being simulated can be statistically complex. Specifically, in the case of multilevel data, where observations are nested within clusters, simulating level-2 variables and effects may not be straightforward. For example, when simulating random intercept variation, slopes, and level-2 predictors, a common approach is to sample observations from a distribution with a fixed mean and variance and then repeat the sampled values as many times as there are observations per cluster. However, this repetition alters the variance of the resulting variables, which no longer matches the population values previously specified. In this study, we illustrate the impact of this issue in two contexts: parameter estimation and power analysis in multilevel models. Through illustrative simulations, we show that when multilevel data are generated using this approach, having few clusters leads to simulated datasets in which the variance of intercepts, slopes, and level-2 predictors is 10% lower than the intended value. This discrepancy biases the results obtained when fitting statistical models to these simulated data, leading to an underestimation of the variance of intercepts and slopes, as well as the statistical power associated with level-2 effects. We present how the data generating process can be adjusted to correct this issue and discuss practical situations in which failing to account for it could have a greater impact.

Poster A Cautionary Note on Simulating Multilevel Data
Author Diego Iglesias
Affiliation Universidad Autónoma de Madrid
Keywords Multilevel Models, Simulation, Estimation, Power

Primary author

Diego Iglesias (Universidad Autónoma de Madrid)

Co-authors

Miguel A. Sorrel (Universidad Autónoma de Madrid) Ricardo Olmos (Universidad Autónoma de Madrid)

Presentation materials

There are no materials yet.