From discussion section of the paper:
“Psychological distress and self-harm are known to increase across early adolescence. Normative data show rising YSR total problems scores with age from age 11 to 16 years in non-clinical samples from a range of countries.[29] Self-harm rates in the general population in the UK and elsewhere increase markedly with age from early to mid-adolescence, being very low in 10 year olds and peaking around age 16-17 years.[53-56] Our finding that psychological function and self-harm did not change significantly during the study is consistent with two main alternative explanations. The first is that there was no change, and that GnRHa treatment brought no measurable benefit nor harm to psychological function in these young people with GD. This is consonant with the action of GnRHa, which only stops further pubertal development and does not change the body to be more congruent with a young person’s gender identity. The second possibility is that the lack of change in an outcome that normally worsens in early adolescence may reflect a beneficial change in trajectory for that outcome, i.e. that GnRHa treatment reduced this normative worsening of problems. In the absence of a control group, we cannot distinguish between these possibilities. We aimed to use normative reference data to examine this issue. However age- and gender-standardised t-scores for ASEBA and other outcomes cannot answer this question as they cover a very broad age range (e.g. 12-18 years). We had anticipated that z-scores on the YSR available by calendar year for two comparable countries (Netherlands; Australia) might be informative however confidence intervals were too wide to draw reliable inferences.“
My translation for the lay person ( I welcome any corrections or comment):
They acknowledge that because they didn’t run a controlled study, they don’t know whether the drug had any impact at all on psychological distress and self harm.
They planned to compare the data to known data for the general population however they were unable to.
I would like to suggest that their chosen method of measuring this data (using median scores and combining two separate questionnaire answers) was always going to lead to an issue comparing to the published data for the general population.
WHY did they test an experimental treatment on young children, if not to reduce those children’s’ psychological distress??? They failed.
The one thing in clinical trials which is worse than a bad outcome, is an outcome which you can’t interpret or explain.
They failed.