- Research
- Open access
- Published:
How robust is the association between school-related stress and internalizing mental health problems? A specification curve analysis
BMC Psychiatry volume 25, Article number: 413 (2025)
Abstract
Background
A recent review of research on the association between school-related stress and internalizing problems found growing scholarly interest in the topic, but also raised questions concerning the quality and reliability of the existing knowledge base. The aim of this study was to investigate how robust the association between school-related stress and internalizing problems is to differences in model specifications.
Methods
Longitudinal survey data from between 2,991 and 4,845 Swedish adolescent students aged 13–16 years were used. A total of 57,322 different models were estimated, varying the choice of sample, measure of internalizing problems, functional form, statistical method, and combinations of included control variables. The results were summarized using specification curve analysis.
Results
Most estimates of the association between school-related stress and internalizing problems were statistically significant at the 5% level. The choice of sample, outcome, functional form, and control variables had a limited influence on the size and significance of the estimates, but the estimates were markedly smaller and mostly non-significant in models investigating lagged effects.
Conclusions
This study showed that school-related stress is a robust predictor of internalizing problems as long as the association is assumed to be contemporaneous, while evidence for lagged effects was weaker. A key conclusion is that the choice of whether to estimate lagged or contemporaneous effects may be the most consequential in studies on school-related stress and internalizing problems or similar topics.
Background
Rates of internalizing mental health problems have increased among adolescents across high-income countries [1, 2]. Internalizing problems encompass symptoms related to anxiety, fear, sadness, depressed mood, feelings of worthlessness, and social withdrawal. Psychosomatic health complaints such as headaches, stomachaches, or feeling tense can also be considered forms of internalizing problems [3,4,5]. Known risk factors for internalizing problems include stressful life events and prolonged exposure to stress [6,7,8]. One of the most commonly reported sources of stress among adolescents is stress or pressure related to school, including stress related to academic demands, achievements, expectations, or school-based social interactions [9,10,11,12,13,14,15,16].
Recently, calls have been made to “take [school-related stress] seriously” as a risk factor for internalizing problems, and as a potential driver of the worrying trends in such problems in adolescents [17, 18]; see also [1, 2, 19]. Accordingly, a recent systematic review of research on the association between school-related stress and internalizing problems found growing scholarly interest in the topic, with 40 of the 52 included studies having been published since 2010, and 20 since 2020 [20]. However, although 48 of the 52 studies reported positive associations between school-related stress and internalizing problems, “most studies were cross-sectional“ and many “insufficiently accounted for [confounders]” [20], prompting calls for more longitudinal research that includes a broader range of confounders.
Responding to this call, the aim of this study was to investigate how robust the association between school-related stress and internalizing problems is to differences in model specifications. To this end, longitudinal survey data on Swedish adolescent students aged 13–16 years were analyzed using specification curve analysis [21]. Specification curve analysis is a statistical technique used to explore the robustness of research results by systematically varying possible model specifications. It allows researchers to assess the extent to which results are dependent on specific, and often arbitrary, choices made during the research process, thus enhancing the transparency of reported findings and reducing the scope for p-hacking or other unsound research practices. In this study, a total of 57,322 different models were estimated, varying the choice of sample, measure of internalizing problems, functional form, statistical method, and control variables.
Data and methods
Data from the Evaluation Through Follow-up (ETF) survey were used. ETF is a repeated longitudinal survey on different cohorts of Swedish students, conducted by Statistics Sweden and the University of Gothenburg [22, 23]. The 2004 cohort, which is the only cohort with data on both school-related stress and internalizing problems from more than one wave, was included in the analysis. The sampling frame consisted of all students attending grade 3 of compulsory school in Sweden in 2014. From this frame, 9,794 students, or around 10% of the total grade 3 student population in Sweden, were sampled through a stratified cluster-based design. Schools were the primary sampling unit and were sampled with probability proportional to their number of students and within strata defined by school ownership. All students in the sampled schools were invited to respond to the survey. Students and/or their guardians had the right to decline participation.
Students were followed longitudinally in grades 6 (year 2017; n = 4,845) and 9 (year 2020; n = 2,991), when most were 13 and 16 years old, respectively. Response rates were 53% in grade 6 and 26% in grade 9 [24]. Weights calibrated based on information from administrative population registers were used to adjust for differential non-response rates. In supplementary analyses, the 1998 cohort was also used. This cohort had a higher grade 9 response rate (48%) but did not include data on internalizing problems in grade 6.
Dependent variable: internalizing problems
Students were asked how often during the last six months they experienced specific internalizing problems. Using the same data, Giota and Gustafsson [25] showed that a two-factor solution fitted the data best, separating what they termed “emotional” and “psychosomatic” problems. Emotional problems included feeling sad, feeling irritated or moody, feeling nervous, feeling low, difficulties concentrating, conflicts with peers, and withdrawal from peers. Psychosomatic problems included headaches, stomachaches, feeling tense, feeling giddy, difficulties sleeping, and poor appetite. Response alternatives were Never (1), Rarely (2), Sometimes (3), Often (4), and Always (5). Students’ average scores on the items were used as measures of emotional and psychosomatic problems, respectively, with higher values representing more problems. Cronbach’s alpha for both scales was 0.83 in grade 6 and 0.84 in grade 9. In supplementary analyses, the scales were transformed into categorical variables, with the 20% of students with the highest scores coded 1 and the remaining 80% coded 0.
Dependent variable: school-related stress
One indicator measuring school-related stress was available in ETF. Students were asked how often they feel stressed in school, with response alternatives ranging from Never (1) to Always (5). The reliability of a single item is difficult to ascertain. However, a similar indicator has been used in the Health Behaviour in School-Aged Children Survey [26]. Moreover, Lindholm et al. [27] reported that a similar global item (“I feel stressed”), though not specifically related to school, was strongly related to more comprehensive and specific measures of stress in Swedish students. In supplementary analyses, the indicator was transformed into a categorical variable, with the response alternative “Always” coded 1 and the remaining alternatives coded 0.
Control variables
The goal of specification curve analysis is to investigate how sensitive estimates are to variations in reasonable analytical choices that researchers can make. One of the most consequential choices in observational research concerns the choice of control variables. It is well known that researchers using observational data ought to control for confounders, but not for mediators or colliders [28]. Some variables, such as basic socio-demographic characteristics that cannot be caused by either school-related stress or internalizing problems, are clearly confounders. All models therefore adjusted for biological sex (0 = boy; 1 = girl), parental education (five categories, from pre-secondary education to at least four years of university) and migration background (1 = born outside Sweden; 0 otherwise).
The status of most other variables was less clear-cut, with theory and previous research not offering unambiguous advice on which variables should be regarded as confounders, mediators, or colliders. For instance, it is plausible that a heavy workload in the form of homework causes both stress and internalizing problems. If so, time spent on homework should be seen as a confounder and thus be controlled for. However, stressed students may also spend more time on homework to cope with high demands, which in turn could result in internalizing problems if they do not get enough time for rest and recovery. In that case, homework is a mediator and should not be controlled for. It is also possible that both stress and internalizing problems independently affect time spent on homework: doing homework may be a way of coping with high demands, and certain internalizing problems related to depression, such as fatigue or difficulties concentrating, may reduce students’ work capacity and force them to spend more time on homework to keep up. If so, homework is a collider and should not be controlled for.
To examine how sensitive the results are to either inappropriate exclusion of confounders, or inappropriate inclusion of mediators or colliders, all unique combinations of the control variables (except for the socio-demographic characteristics) were included in the analysis. However, the possible combinations increase exponentially with the number of control variables. For instance, six variables yield 63 possible combinations, while 16 variables yield 65,535 combinations. To balance the need for a flexible analysis of the importance of different covariate combinations with a manageable number of models, 12 control variables in total were chosen, yielding 4,095 combinations. In some models, only 11 were included since one variable (student’s grade point average) was only available in grade 9. Besides this variable, priority was given to control variables that were measured in both grades 6 and 9 and thus relevant for longitudinal models.
Achievement was measured by students’ final grade point average, representing the average grade in all the 16 or 17 subjects taken in Swedish compulsory school. This was only available for students in grade 9.
Self-reported schoolwork difficulties were measured using students’ average score on three items of the type “I find it difficult to keep up in lessons”, with response alternatives ranging from Never (1) to Always (5). Cronbach’s alpha for the scale was 0.67 (grade 6) and 0.69 (grade 9).
Academic self-concept was measured using students’ average scores on two sets of items. The first set included items related to general self-concept, such as “How good do you think you are in Mathematics?” with response alternatives ranging from Very poor (1) to Very good (5). The second set included items related to specific tasks, such as how well students can “Read and understand a text [in Swedish]”, with response alternatives ranging from Very poorly (1) to Very well (5). Cronbach’s alpha was 0.42 (grade 6) and 0.48 (grade 9) for the general self-concept scale and 0.69 (grade 6) and 0.67 (grade 9) for the specific self-concept scale. The general self-concept scale was kept in the analysis despite low reliability since including each item separately would increase the number of models to be estimated exponentially.
Achievement goal orientation was measured using students’ average scores on two sets of items. The first set included four items related to extrinsic and future-oriented goals, such as “[I] learn so that I can get a well-paid job”. The second set included three items related to relative performance and competitiveness, such as “[I] try to be better than other pupils in the class”. Response alternatives ranged from Never (1) to Always (5). Cronbach’s alpha was 0.88 (grade 6) and 0.79 (grade 9) for the extrinsic goals scale and 0.72 (both grades) for the competitive goals scale.
Relationships with peers and relationships with teachers were measured using single items: “I feel excluded in school” and “My teachers treat me well in school”, respectively, with response alternatives ranging from Never (1) to Always (5).
School contentment or satisfaction was measured using students’ average scores on five items such as “How content are you in your current class?”, with response alternatives ranging from Very content (1) to Not at all content (5). Cronbach’s alpha for the scale was 0.85 (grade 6) and 0.84 (grade 9).
Time spent on homework was measured using a single item: “How much time do you spend on average each week doing home assignments or homework?”, with response alternatives ranging from I don’t do homework (1) to 7 h or more (6) in grade 6 and from No time at all (1) to 4 h or more (5) in grade 9. The two highest response alternatives in the grade 6 item were collapsed into one to correspond with the grade 9 response alternatives.
Two indicators of non-school-related leisure activities were used. Physical activity was measured by a single item: “How much time do you spend on average per week doing sports or exercising?”. Screen time was measured using students’ average score on two items: ”How much time do you spend on average per week watching TV, movies or YouTube?” and ”How much time do you spend on average per week playing video or computer games?”. Response alternatives ranged from 0 h (1) to ≥ 41 h (6). No data on social media usage were available.
All included variables have been shown to be related either to school-related stress or internalizing problems in adolescents: achievement [29], schoolwork difficulties [25, 30], academic self-concept [31], goal orientations [32, 33], school contentment [34], homework [35], physical activity [36], and screen time [37, 38]. Further information on the measurement of the variables, the construction of scales, as well as summary statistics, is reported in Additional file A.
Specification curve analysis
Table 1 displays decisions a researcher analyzing the association between school-related stress and internalizing problems using observational data must make, as well as the choices made in the present study. A first decision concerns which data to use. ETF can be analyzed using either cross-sectional or longitudinal data. In this study, both types were used so as to maximize the variance in possible analytical choices. This also reflects the existing literature on the topic, which includes both cross-sectional and longitudinal designs [20]. The cross-sectional analyses used data from both grades 6 and 9, but separately, while the longitudinal analyses included both. A second decision concerns the measure of the outcome. Internalizing problems may be viewed as either dimensional or categorical. In this study, a dimensional approach was used in the main analysis and a categorical approach in supplementary analyses. A third decision concerns the choice of control variables, as discussed in the previous subsection. All socio-demographic control variables, and all possible combinations of the remaining control variables, were included.
A fourth decision concerns the choice of statistical model, which in turn incorporates the choice of how to model the functional form of the association, how to adjust for control variables, and how to model the associations over time in the longitudinal data. As for the functional form, the analyses presented in the paper assumed linear associations, while non-linear associations were modeled using logistic regression in supplementary analyses. The results from logistic models were presented separately in Additional file B since odds ratios cannot be directly compared with linear regression coefficients. As for how to adjust for control variables, the main alternatives are regression-based approaches– in which the associations between the control variables, exposure and outcome are modeled simultaneously– and matching or balancing approaches that model the associations between control variables and the exposure before estimating the association between the exposure and the outcome (e.g., propensity score matching). Matching or balancing typically assumes that the exposure is binary, which is not the case with school-related stress. Only regression-based adjustment was therefore used in this study. As for how to model associations over time, dozens of alternative models are available, many of which, however, were not compatible with the data at hand. In this study, three common types of longitudinal models were estimated: fixed effects (FE) models, lagged dependent variable (LDV) models, and prospective cohort models.
FE and LDV models constitute two different ways of adjusting for confounding in longitudinal analysis, and are both popular in social science [39]. FE models only use variation within individuals over time to estimate associations. With two waves of data, FE models are equivalent to first difference models: the change in the outcome between the first and second wave is regressed on the corresponding change in the exposure and control variables. This effectively controls for all time-invariant confounding with constant effects over time. FE models, however, tend to perform poorly in the context of lagged effects of the exposure on the outcome, or lagged effects of the outcome on the exposure (i.e., reverse causation) [39,40,41]. LDV models adjust for lagged scores of the outcome when estimating the contemporaneous association between the exposure and the outcome, with the rationale that the lagged outcome can adjust for the most important sources of confounding. Estimates from LDV models are more robust to reverse causality compared to FE models [39]. Angrist and Pischke [42] show that FE and LDV models have a useful bracketing property: when FE models are correct, LDV models will underestimate effects, while when LDV models are correct, FE models will overestimate effects.
The prospective cohort model is more popular in epidemiology and public health research. The focal coefficient of these models is the effect of the exposure measured at baseline on the outcome measured at follow-up, adjusted for the outcome measured at baseline. Like the LDV model, but unlike the FE model, the prospective cohort model uses lagged scores on the outcome to adjust for confounding. Unlike both FE and LDV models, the focus of the prospective cohort model is on the lagged rather than contemporaneous effect of the exposure on the outcome.
In sum, the analyses considered linear cross-sectional regression models, separately for grade 6 and 9 students, as well as linear longitudinal models (FE, LDV and prospective cohort models). All models used both emotional and psychosomatic problems as the outcome and included all socio-demographic control variables and all possible combinations of the 11–12 remaining control variables, yielding a total of 24,566 models in the main analyses. In supplementary analyses, another 32,756 models were estimated, yielding a total of 57,322 models. The specification curves were created using the speccurve program in Stata [43]. Additional technical details on the models are provided in Additional file A.
Results
Figure 1 shows results of the cross-sectional analyses. Both school-related stress and internalizing problems were standardized (mean = 0; standard deviation = 1) to facilitate comparison of effect sizes across models. The estimates ranged from a low of 0.31 to a high of 0.58, with around 90% of the estimates falling between 0.35 and 0.45. All estimates were significant at the 5% level, as can be seen from the 95% confidence intervals. The distribution of estimates was smooth, and there was no indication that a particular choice of specification resulted in a discrete jump in the effect sizes. The large number of estimates makes it impossible to see which models generate which estimates. Figure 2 therefore shows a subset of the estimates: the 10 largest and smallest estimates, and 10 random estimates in between. The smallest estimates consistently include competitive achievement goals, relationships with peers (i.e., feeling excluded), school contentment, and schoolwork difficulties, while the largest estimates consistently exclude these variables. Likewise, the largest estimates mostly include extrinsic (future-oriented) achievement goals, homework, and leisure-based physical activity, while the smallest estimates mostly exclude these variables.
Figures 3 and 4 show results of the longitudinal analyses. Unlike in Fig. 1, with the corresponding cross-sectional results, there is a discrete break in Fig. 3. The break reflects the markedly smaller estimates from the prospective cohort models compared to the estimates from the FE and LDV models. The estimates from the prospective cohort models ranged from 0.04 to 0.09, and most were not significant at the 5% level. Figure 4 also makes clear that the smallest estimates were with psychosomatic problems as the outcome. Estimates from the FE and LDV models were more smoothly distributed and also overlapped with each other to a considerable extent. Overall, estimates from the FE models were smaller, with the smallest being 0.22, and estimates from the LDV models larger, with the largest being 0.45.
Supplementary analyses
Additional file B shows that the results were qualitatively similar when categorical transformations of the exposure and outcome, and logistic regression models, were used. Additional file C shows that the cross-sectional results for grade 9 students were very similar when the 1998 cohort, with higher response rates, was used. Additional file D shows that stress or internalizing problems in grade 6 did not predict non-response in grade 9.
Discussion
Following calls for more longitudinal studies that include a broad range of confounders in research on the association between school-related stress and internalizing problems [18, 20], the aim of this study was to investigate how robust the association between school-related stress and internalizing problems is to differences in model specifications. To this end, specification curve analysis [21] was used to analyze longitudinal survey data on Swedish adolescent students aged 13–16, varying the choice of sample, measure of internalizing problems, functional form, statistical method, and combinations of included control variables. A total of 57,322 different regression models were estimated.
Two main results ought to be highlighted. First, the association between school-related stress and internalizing problems was largely robust, with around 90% of all estimates being significant at the 5% level. This is in line with the review by Steare et al. [20], which found that 48 of 52 included studies reported at least one positive association between school-related stress and internalizing problems. Most of the standardized coefficients in the present study ranged between 0.25 and 0.45, which is similar to estimates reported in, for instance, Kristensen et al. [13]. Since the indicator of school-related stress could only take on five different values, this means that going from the lowest to the highest level of stress was typically associated with between a 1.25 and 2.25 standard deviation increase in internalizing problems.
However, and second, the longitudinal estimates were generally somewhat smaller than the cross-sectional ones. Longitudinal data can more effectively adjust for confounding and are therefore, despite entailing other potential sources of bias (e.g., due to attrition), generally regarded as superior from the perspective of causal inference. The most marked difference was between the prospective cohort models and all other specifications. The prospective cohort model is identical to the LDV model, apart from when the focal exposure (school-related stress) is measured: contemporaneously in the LDV model or at baseline in the prospective cohort model. Thus, the results from the different specifications can be summarized as follows: the choice of outcome, including whether the outcome is operationalized as dimensional or categorical, seems to be least consequential for the results. The choice of which variables to include and whether to use cross-sectional or longitudinal data has some bearing on the results, but not a particularly large one as long as the association between school-related stress and internalizing problems is measured contemporaneously or as change-scores. The by far most consequential choice concerns whether to estimate lagged effects of school-related stress.
The choice between contemporaneous and lagged effects must be made on substantive grounds, using theory, reason, and relevant previous research [44]. On the one hand, causality operates in time and strictly contemporaneous or instantaneous effects, down to the most miniscule nanosecond, are impossible. On the other hand, the gaps between measurement points in a given data set may not match the lags operating in the real world [39, 41, 44]. What are, then, the temporal properties of the underlying causal process through which school-related stress is assumed to affect internalizing problems? Extant theory and research are arguably not sufficiently developed to allow for a precise answer concerning the appropriate lags in the context of school-related stress and internalizing problems. This context is particularly complex since it concerns associations between mental and emotional phenomena that are best understood as dimensional. Thus, there are no clear points that mark when the exposure (school-related stress) first occurred and thus started to exert its influence. However, it seems plausible that both almost immediate and more long-term effects are relevant and that they do not need to be mutually exclusive.
Some internalizing problems included in this study– such as irritation, nervousness, tenseness, giddiness, or sleep problems– may be almost instantaneously affected by stress. They are also conceptualized as manifestations of school-related stress by students themselves [45]. Thus, even though strictly contemporaneous effects are, as stated, impossible, the temporal dynamics may nevertheless be well approximated as such [39, 41, 44]. Long-term lagged effects may occur due to stress sensitization [46], whereby earlier exposure to school-related stress increases vulnerability to later stressors, including stressors not related to school. Thus, there may be a heightened risk of internalizing problems at later time points also in the absence of school-related stress at that particular time point. A special form of long-term effect is the adverse effects of chronic stress [47], with the gradual “wear and tear of the body” resulting from chronic stress making the individual progressively more susceptible to developing internalizing problems over time. Both stress sensitization and chronic stress are processes that may unfold over several years, thus suggesting that extended lag-times may be relevant.
Nonetheless, the finding that the longitudinal associations were smaller than the cross-sectional ones suggests that the cross-sectional associations are biased upwards due to unmeasured confounding. Thus, the general proposition that longitudinal data are preferable from the perspective of causal inference is borne out in the context of school-related stress and internalizing problems. Moreover, the fact that the prospective cohort model yielded small and mostly non-significant estimates suggests that school-related stress does not cause persistent harm once removed. One interpretation of this is that students are adaptive, which, in turn, suggests that interventions to reduce stress can yield fairly swift mental health benefits. This also suggests that information on school-related stress may have limited prognostic value for practitioners in predicting future internalizing problems, at least over and above information on current internalizing problems. However, it should be noted that this conclusion is partly contradicted by previous prospective cohort studies on the topic, all of which found evidence of lagged and long-term effects. In these studies, time from baseline measurements to follow-up varied from 6 to 12 months [48], to one year [49] and five years [50], as compared to three years in the present study.
Limitations
One major limitation of the present study was the low response rates, especially in grade 9. Response rates in ETF have declined in recent decades. The causes of this decline– which is part of a broader trend– are not known, but may include lower trust in institutions and authorities, stricter privacy concerns, or time limitations. In the case of ETF, the lower response rate in grade 9 compared to grade 6 may be because the grade 6 survey was conducted in school whereas the grade 9 survey was sent to the students’ homes. Another reason may be that participation in the grade 9 survey required written informed consent from the students. Of the 7,048 students that did not provide data in grade 9, 964 (14%) completed the survey but did not fill out the consent form [24]. Moreover, the response rate in grade 9, surveyed in the spring of 2020, may have been influenced by the COVID-19 pandemic.
Low response rates can undermine the external validity as well as the reliability of results due to possible non-response bias. It is difficult to ascertain the direction of any non-response bias in this case. In general, the magnitude of the associations will be biased upward if students with high levels of both school-related stress and internalizing problems were more likely to participate than students with high levels of only one of these measures. The finding that the results were very similar in samples with higher response rates– grade 6 and grade 9 in the 1998 cohort– suggests that at least the cross-sectional results may be robust to potential non-response bias. Moreover, supplementary analyses showed that stress and internalizing problems in grade 6 did not, either independently or in combination, predict non-response in grade 9 (Additional file D). The possibility that non-participation in both surveys was related to stress or internalizing problems cannot, however, be ruled out. It should also be noted that the use of calibration weights corrected for higher non-response among boys, low-achieving students, and students with low socioeconomic backgrounds [24].
A second limitation is that ETF only contains a single indicator of school-related stress. This item may have limited reliability and validity compared to more comprehensive scales. The use of a single item also means that specific subsets of school-related stress identified in previous research– such as stress related to achievements, school-leisure conflicts, or social interactions in school [9,10,11, 45]– could not be identified. A third limitation is that only two waves of data were used in the longitudinal analyses. More, and more proximally spaced, measurement points would enable more advanced and credible analyses of reciprocal and lagged effects. A fourth limitation is that both school-related stress and internalizing problems were self-reported, making the results sensitive to common methods bias [51].
Moreover, the specification choices considered in this study do not encompass all possible choices that can be made using ETF data, let al.one other sources of data. For instance, it is possible that confounding factors interact in complex, non-linear ways that are not captured by the linear and additive effects assumed in linear regression models. Estimating all second- or higher-order interactions of all possible control variable combinations would, however, be unwieldy. Likewise, the data were not compatible with some state-of-the-art methods for longitudinal analysis. For instance, event study and synthetic control designs, popular in the social sciences [52], are most suitable in contexts where the exposure is a discrete event. Likewise, random-intercept cross-lagged panel models designed to simultaneously handle unobserved time-invariant confounding and lagged effects [53, 54], and that are popular in psychology [13], require more than two waves of data.
Conclusion
Based on a specification curve analysis of 57,322 regression models, this study showed that school-related stress is a robust predictor of internalizing problems, regardless of the choice of data and sample, measurement of the outcome, assumed functional form, included control variables, or statistical model. The one exception was the prospective cohort models that modeled lagged effects adjusted for baseline values of the outcome, which generally found these effects to be small and non-significant. A key takeaway is therefore that the choice of whether to estimate lagged or contemporaneous effects may be the most consequential one in studies on school-related stress and internalizing problems or similar topics. Research using multi-wave data or high-frequency longitudinal designs can make further headway in understanding the temporal dynamics underlying the association between school-related stress and internalizing problems. Such research would, moreover, gain from incorporating the flexibility and transparency inherent in specification curve analysis.
Data availability
Data are not available for public use. Please see the ETF website for more information: https://www.gu.se/utvardering-genom-uppfoljning.
Abbreviations
- ETF:
-
Evaluation through follow-up
- FE:
-
Fixed effects
- LDV:
-
Lagged dependent variable
References
Keyes KM, Platt JM. Annual research review: sex, gender, and internalizing conditions among adolescents in the 21st century– trends, causes, consequences. J Child Psychol Psychiatry. 2024;65384–407. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/jcpp.13864
Armitage JM, Collishaw S, Sellers R. Explaining long-term trends in adolescent emotional problems: what we know from population-based studies. Discov Soc Sci Health. 2025;4(14). https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s44155-024-00076-2
Merrell KW. Helping students overcome depression and anxiety: A practical guide. 2nd ed. New York, NY, US: Guilford Press; 2008.
Hagquist C. Psychometric properties of the psychosomatic problems scale: A Rasch analysis on adolescent data. Soc Indic Res. 2008;86(3):511–23. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s11205-007-9186-3
Ando S, Yamasaki S, Shimodera S, Sasaki T, Oshima N, Furukawa TA, et al. A greater number of somatic pain sites is associated with poor mental health in adolescents: a cross-sectional study. BMC Psychiatry. 2013;13(1):30. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/1471-244X-13-30
Pearlin LI. The sociological study of stress. J Health Soc Behav. 1989;30(3):241–56. https://doiorg.publicaciones.saludcastillayleon.es/10.2307/2136956
Holmes TH, Rahe RH. The social readjustment rating scale. J Psychosom Res. 1967;11(2):213–8. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/0022-3999(67)90010-4
March-Llanes J, Marqués-Feixa L, Mezquita L, Fañanás L, Moya-Higueras J. Stressful life events during adolescence and risk for externalizing and internalizing psychopathology: a meta-analysis. Eur Child Adolesc Psychiatry. 2017;26(12):1409–22. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00787-017-0996-9
Anniko MK, Boersma K, van Wijk NPL, Byrne D, Tillfors M. Development of a shortened version of the adolescent stress questionnaire (ASQ-S): construct validity and sex invariance in a large sample of Swedish adolescents. Scand J Child Adoles. 2018;6(1):4–15. https://doiorg.publicaciones.saludcastillayleon.es/10.21307/sjcapp-2018-001
Moksnes UK, Byrne DG, Mazanov J, Espnes GA. Adolescent stress: evaluation of the factor structure of the adolescent stress questionnaire (ASQ-N). Scand J Psychol. 2010;51(3):203–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/j.1467-9450.2009.00803.x
Byrne DG, Davenport SC, Mazanov J. Profiles of adolescent stress: the development of the adolescent stress questionnaire (ASQ). J Adolesc. 2007;30(3):393–416. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.adolescence.2006.04.004
García-Moya I, Corell M, Paniagua C, Vaez M, Löfstedt P. School stress and health complaints: testing the adolescent stress questionnaire in Spain and Sweden. Eur J Public Health. 2021;31(Supplement3). https://doiorg.publicaciones.saludcastillayleon.es/10.1093/eurpub/ckab164.310
Kristensen SM, Larsen TMB, Urke HB, Danielsen AG, Academic, Stress. Academic Self-efficacy, and psychological distress: A moderated mediation of Within-person effects. J Youth Stud. 2023;18(5):598–616. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/13676261.2014.992317
Kappner N, Lang J, Berthold A, Gaum PM. The moderating effect of social resources on the link between study-related stressors and depressive symptoms among medical students in North Rhine-Westphalia, Germany– a cross-sectional study. BMC Psychiatry. 2022;22(1):524. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12888-022-04170-0
Chen B, Wang W, Yang S. The relationship between academic stress and depression among college students during the COVID-19 pandemic: a cross-sectional study from China. BMC Psychiatry. 2024;24(1):46. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12888-024-05506-8
Liu F, Yang D, Liu Y, Zhang Q, Chen S, Li W, et al. Use of latent profile analysis and k-means clustering to identify student anxiety profiles. BMC Psychiatry. 2022;22(1):12. https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12888-021-03648-7
Thornton S. Academic pressure: taking the issue seriously. Br J Child Health. 2024;5(1):46–8. https://doiorg.publicaciones.saludcastillayleon.es/10.12968/chhe.2024.5.1.46
The Lancet Child & Adolescent Health. Time to take academic pressure seriously. Lancet Child Adolesc Health. 2023;7(10):671. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/S2352-4642(23)00231-6
Thompson D. (2023) We’re Missing a Key Driver of Teen Anxiety. The Atlantic. https://www.theatlantic.com/newsletters/archive/2023/03/teen-anxiety-elite-schools-sat-act-paradox-wealthy-nations/673307/. Accessed 12 April 2024.
Steare T, Gutiérrez Muñoz C, Sullivan A, Lewis G. The association between academic pressure and adolescent mental health problems: A systematic review. J Affect Disord. 2023;339:302–17. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jad.2023.07.028
Simonsohn U, Simmons JP, Nelson LD. Specification curve analysis. Nat Hum Behav. 2020;4(11):1208–14. https://doiorg.publicaciones.saludcastillayleon.es/10.1038/s41562-020-0912-z
Härnqvist K. Evaluation through follow-up. A longitudinal program for studying education and career development. In: Janson C-G, editor Seven Swedish longitudinal studies in behavioral science. Forskningsrådsnämnden, Stockholm, 2000;76–114.
Svensson A. Utvärdering Genom Uppföljning. Longitudinell individforskning under Ett Halvsekel. University of Gothenburg. Faculty of Education; 2011.
Statistics Sweden. (2021) Enkätundersökning till elever i årskurs 9 för UGU-projektet Teknisk rapport - En beskrivning av genomförande och metoder. Statistics Sweden. https://www4.gu.se/compeat/FUR/UGU/Rapporter/UGU2004_Teknisk rapport med bilagor_ny_ak9.pdf. Accessed 22 April 2024.
Giota J, Gustafsson J-E. Perceived demands of schooling, stress and mental health: changes from grade 6 to grade 9 as a function of gender and cognitive ability. Stress Health. 2017;33(3):253–66. https://doiorg.publicaciones.saludcastillayleon.es/10.1002/smi.2693
Inchley JC, Stevens GWJM, Samdal O, Currie DB. Enhancing Understanding of adolescent health and Well-Being: the health behaviour in School-aged children study. J Adolesc Health. 2020;66(6):S3–5. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jadohealth.2020.03.014
Lindblad F, Backman L, Åkerstedt T. Immigrant girls perceive less stress. Acta Paediatr. 2008;97(7):889–93. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/j.1651-2227.2008.00857.x
Cinelli C, Forney A, Pearl J. A crash course in good and bad controls. Sociol Method Res. 2024;53(3):1071–104. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/00491241221099552
Bortes C, Nilsson K, Strandh M. Associations between children’s diagnosed mental disorders and educational achievements in Sweden. Scand J Public Health. 2022;50(8):1140–7. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/14034948221089056
Deighton J, Humphrey N, Belsky J, Boehnke J, Vostanis P, Patalay P. Longitudinal pathways between mental health difficulties and academic performance during middle childhood and early adolescence. Br J Dev Psychol. 2018;36(1):110–26. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/bjdp.12218
Morales A, Rodríguez-Menchón M, Espada JP, Orgilés M. Examining academic self-concept as a mediator of the relationship between anxiety and depression: A longitudinal study. Child Adolesc Ment Health. 2023;28(3):354–62. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/camh.12577
Bergh D, Giota J. Student achievement goals and psychosomatic health complaints among Swedish adolescents: the role of sex. J Public Health. 2022;301011–22. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s10389-020-01374-0
Tuominen-Soini H, Salmela-Aro K, Niemivirta M. Achievement goal orientations and subjective well-being: A person-centred analysis. Learn Instr. 2008;18(3):251–66. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.learninstruc.2007.05.003
Löfstedt P, García-Moya I, Corell M, Paniagua C, Samdal O, Välimaa R, et al. School satisfaction and school pressure in the WHO European region and North America: an analysis of time trends (2002–2018) and patterns of Co-occurrence in 32 countries. J Adolesc Health. 2020;66(6, Supplement):S59–69. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jadohealth.2020.03.007
Galloway M, Conner J, Pope D. Nonacademic effects of homework in privileged, high-Performing high schools. J Exp Educ. 2013;81(4):490–510. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/00220973.2012.745469
Pascoe MC, Hetrick SE, Parker AG. The impact of stress on students in secondary school and higher education. Int J Adolesc Youth. 2020;25(1):104–12. https://doiorg.publicaciones.saludcastillayleon.es/10.1080/02673843.2019.1596823
Stiglic N, Viner R. Effects of screentime on the health and well-being of children and adolescents: a systematic review of reviews. BMJ Open. 2019;9(1):e023191. https://doiorg.publicaciones.saludcastillayleon.es/10.1136/bmjopen-2018-023191
Orben A. Teenagers, screens and social media: a narrative review of reviews and key studies. Soc Psychiatry Psychiatr Epidemiol. 2020;55(4):407–14. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s00127-019-01825-4
Vaisey S, Miles A. Sociol Method Res. 2017;46(1):44–67. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/0049124114547769. What You Can—and Can’t—Do With Three-Wave Panel Data.
Imai K, Kim IS. When should we use unit fixed effects regression models for causal inference with longitudinal data?? Am J Political Sci. 2019;63(2):467–90. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/ajps.12417
Leszczensky L, Wolbring T. How to deal with reverse causality using panel data?? Recommendations for researchers based on a simulation study. Soc Method Res. 2022;51(2):837–65. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/0049124119882473
Angrist JD, Pischke JS. Mostly harmless econometrics: an empiricist’s companion. Princeton: Princeton University Press; 2009.
Andresen M. (2024) speccurve - Stata package to plot specification curves. https://github.com/martin-andresen/speccurve Accessed 15 April 2024.
Bellemare MF, Masaki T, Pepinsky TB. Lagged explanatory variables and the Estimation of causal effect. J Politics. 2017;79(3):949–63. https://doiorg.publicaciones.saludcastillayleon.es/10.1086/690946
Östberg V, Almquist YB, Folkesson L, Brolin Låftman S, Modin B, Lindford P. The complexity of stress in Mid-Adolescent girls and boys. Child Ind Res. 2015;8(2):403–23. https://doiorg.publicaciones.saludcastillayleon.es/10.1007/s12187-014-9245-7
Monroe SM, Harkness KL. Life stress, the kindling hypothesis, and the recurrence of depression: considerations from a life stress perspective. Psychol Rev. 2005;112(2):417–45. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/0033-295X.112.2.417
McEwen BS. Stress, adaptation, and disease: allostasis and allostatic load. Ann N Y Acad Sci. 1998;840:33–44. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/j.1749-6632.1998.tb09546.x
Torsheim T, Aaroe LE, Wold B. School-related stress, social support, and distress: prospective analysis of reciprocal and multilevel relationships. Scand J Psychol. 2003;44(2):153–9. https://doiorg.publicaciones.saludcastillayleon.es/10.1111/1467-9450.00333
Fu Y, Ren W, Liang Z. Perceived academic stress and depressive symptoms among Chinese adolescents: A moderated mediation analysis of overweight status. J Affect Disord. 2022;296:224–32. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jad.2021.09.060
Kaman A, Otto C, Klasen F, Westenhöfer J, Reiss F, Hölling H, et al. Risk and resource factors for depressive symptoms during adolescence and emerging adulthood– A 5-year follow-up using population-based data of the BELLA study. J Affect Disord. 2021;280:258–66. https://doiorg.publicaciones.saludcastillayleon.es/10.1016/j.jad.2020.11.036
Podsakoff PM, Organ DW. Self-Reports in organizational research: problems and prospects. J Manag. 1986;12(4):531–44. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/014920638601200408
Athey S, Imbens GW. The state of applied econometrics: causality and policy evaluation. J Econ Perspect. 2017;31(2):3–32. https://doiorg.publicaciones.saludcastillayleon.es/10.1257/jep.31.2.3
Hamaker EL, Kuiper RM, Grasman RP. A critique of the cross-lagged panel model. Psychol Methods. 2015;20(1):102–16. https://doiorg.publicaciones.saludcastillayleon.es/10.1037/a0038889
Allison PD, Williams R, Moral-Benito E. Maximum likelihood for Cross-lagged panel models with fixed effects. Socius. 2017;3. https://doiorg.publicaciones.saludcastillayleon.es/10.1177/2378023117710578
Acknowledgements
I thank the members of the ETF-team at Gothenburg University for sharing the data. Special thanks to Lucas Fischer Madsen and Nelika Karimi for their assistance with the data application and in answering queries regarding the data.
Funding
Open access funding provided by Umea University.
This study was funded by the Swedish Research Council for Health, Working Life and Welfare (grant no. 2022 − 01062). The study is based on data from the research infrastructure ETF (UGU) at the University of Gothenburg. ETF is funded by the Swedish Research Council (grant no. 2017 − 00628) and the University of Gothenburg.
Author information
Authors and Affiliations
Contributions
As the sole author, BH conceived of the idea, conducted the data analyses, and wrote and edited the manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
The use of data for the present study was approved by the Swedish Ethical Review Authority (Dnr 2023-03999-01). Active and written informed consent was obtained from students in grade 9 through a specific consent form on the first page of the questionnaire. In grade 6, parents and students were given written information about the study. Parents of grade 6 students had the option to actively decline their child’s participation. Grade 6 students were informed that participation was voluntary, but no written informed consent was obtained. The study was not a clinical trial and did not imply any collection of new data from participants.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Högberg, B. How robust is the association between school-related stress and internalizing mental health problems? A specification curve analysis. BMC Psychiatry 25, 413 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12888-025-06829-w
Received:
Accepted:
Published:
DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12888-025-06829-w