In a new article published online ahead of print in the Journal of Affective Disorders, psychologist Eiko Fried examines the wide range of symptoms that appear on depression rating scales. Fried, from the University of Amsterdam (UvA), analyses the degree to which symptoms overlap on seven major depression scales and his results call into question the reliability of the research on depression.
“These findings imply that the routine practice of using scales as interchangeable measurements of depression severity is problematic and may pose a major threat to the generalizability and replicability of depression research,” he writes,
“Given the high prevalence rates and burden caused by [major depressive disorder] MDD, and the size of the research field – a non-exhaustive search of a few databases and a small number of journals identified around 50.000 depression articles published between 1990 and 1999 alone – the severity of this situation can hardly be overstated.”

Depression is a highly researched topic. Researchers from the social sciences to neuroscience and genetics use depression severity scales in their studies. For example, three different papers, each establishing a measurement tool for depression severity, find their way into the 100 most cited papers in all of science.
It is common for researchers to pick one such measurement tool for their study (it is estimated that there are over 280 different instruments to choose from). Often, studies then draw conclusions about depression in general, rather than making narrower statements about the type of depression measured by a particular scale. This wouldn’t pose a problem, necessarily, if it could be assumed that all of these different tools were similarly quantifying the same underlying experience of depression.
“If this assumption does not hold,” Fried writes, then “results of depression studies may be idiosyncratic to the particular scale used, posing a major challenge to the replicability and generalizability of depression research.”
Fried points to several reasons why this assumption may be misguided.
- Studies that use multiple depression severity scales often find that individuals perform significantly different on different scales
- Psychometric analyses suggest that these scales are multidimensional, meaning they are simultaneously measuring several constructs and not just ‘depression’
- Research shows that depression can present very differently in different people and that different symptoms appear to respond differently to various treatment approaches
To assess the degree to which the symptoms being measured overlap with one another, he examined seven common rating scales: The Beck Depression Inventory (BDI-II), Hamilton Rating Scale of Depression HRSD, CES-D, Inventory of Depression Symptoms (IDS), Quick Inventory of Depressive Symptoms (QIDS), Montgomery-Asberg Depression Rating Scale (MADRS), and the Zung Self-Rating Depression Scale (SDS).
The results show a total of 52 specific disparate depression symptoms measured across the scales. For instance, the HRSD tends to look at more physical symptoms like weight loss and delayed movements, while the BDI-II focuses on cognitive symptoms, like feeling guilty or worthless.
Across all of the scales, the overlap was low between the symptoms being measured on each scale. The results reveal that 40% of all symptoms appeared only in a single scale and that only 12% of symptoms were consistent across all instruments.
“Since different instruments capture different aspects of the heterogeneous depressive syndrome, there is the risk that the selection of a particular scale for a study may severely bias results,” Fried concluded. “Considering the persistent lack of progress in core research areas such as antidepressant efficacy and biomarkers robustly associated with depression diagnosis, this topic deserves more attention in contemporary research.”
****
Eiko I. Fried, The 52 symptoms of major depression: lack of content overlap among seven common depression scales, Journal of Af ective Disorders, http://dx.doi.org/10.1016/j.jad.2016.10.019 (Abstract)