Scolaris Content Display Scolaris Content Display

Benzodiazepines for catatonia in people with schizophrenia or other serious mental illnesses

Collapse all Expand all

Background

Catatonia is a debilitating disorder of movement and volition associated with schizophrenia and some other mental illnesses. People with catatonia are more likely to require hospitalisation and highly supervised care than those without the disorder. They also have an increased risk of secondary complications such as pneumonia, malnutrition and dehydration. The mainstay of treatment has been drug therapies and electroconvulsive therapy.

Objectives

To compare the effects of benzodiazepines with other drugs, placebo or electroconvulsive therapy for catatonia in people with schizophrenia or other similar serious mental illnesses (SMIs).

Search methods

We updated our previous search (28 February 2007) by searching the Cochrane Schizophrenia Group's Study‐Based Register of Trials (9 November 2016; 6 February 2019). This register is compiled by systematic searches of major resources (including CENTRAL, MEDLINE, Embase, AMED, BIOSIS, CINAHL, PsycINFO, PubMed, and registries of clinical trials) and their monthly updates, handsearches, grey literature, and conference proceedings, with no language, date, document type, or publication status limitations for inclusion of records into the register. We also manually searched reference lists from studies selected by the search.

Selection criteria

All controlled clinical trials that randomised people who have schizophrenia or other similar SMI and experiencing catatonia to receive benzodiazepines or another relevant treatment. We included studies that met our inclusion criteria and reported usable data. We excluded those not meeting our inclusion criteria or those not reporting usable data. We contacted authors when we required further information; and if we received no response, we put those studies aside as 'awaiting assessment'.

Data collection and analysis

Review authors extracted data independently. For dichotomous data we calculated relative risks (RR) and their 95% confidence intervals (CI) on an intention‐to‐treat basis using a fixed‐effect model. We completed a 'Risk of bias' assessment for the included study and generated a 'Summary of findings' table using GRADE.

Main results

The searches found 130 citations, from which we could identify 22 possibly relevant studies. From these, we could only include one study. This study had a relatively small sample size of 17 participants who received lorazepam or oxazepam and were drug free for one week before the trial started. The only usable data reported by this study were clinically important change in symptoms of catatonia measured as 50% improvement on the Visual Analogue Scale (VAS). There was no difference in the numbers of participants showing a clinically important change in their catatonic symptoms (RR 0.95, 95% CI 0.42 to 2.16; participants = 17; studies = 1; very low quality evidence).

No data were reported for other important outcomes of hospital stay, clinically important change in satisfaction with care, global state, adverse effects or general functioning

We did find a few studies meeting our inclusion criteria but they reported no usable data. We had to exclude these. Although poorly reported, these studies do illustrate that relevant studies have been undertaken — they are not impossible to design and conduct.

Authors' conclusions

Analysis of the results from this review, which was a head‐to‐head comparison of two benzodiazepine monotherapies, does not show a clear difference in effect. No data were available for benzodiazepines compared to placebo or standard care. The lack of usable data and very low quality of data available makes it impossible to draw firm conclusions and further studies with a high‐quality methodology and reporting are required in order to determine more definitively the outcomes associated with benzodiazepine use in the clinical management of catatonia in persons with schizophrenia and other SMI.

PICOs

Population
Intervention
Comparison
Outcome

The PICO model is widely used and taught in evidence-based health care as a strategy for formulating questions and search strategies and for characterizing clinical studies or meta-analyses. PICO stands for four different potential components of a clinical question: Patient, Population or Problem; Intervention; Comparison; Outcome.

See more on using PICO in the Cochrane Handbook.

Benzodiazepines for catatonia in people with schizophrenia or similar serious mental illness

Review question
Are the group of drugs called benzodiazepines an effective and tolerable treatment for catatonia in people with schizophrenia or other serious mental illnesses?

Background
Catatonia is a debilitating condition that is characterised by diminished, excessive or peculiar movement and activity as well as diminished engagement with the social and physical environment. It can occur when a person has a number of different psychiatric conditions, including schizophrenia (an enduring mental illness whose hallmark is an altered perception of reality); and less frequently with medical conditions. Some of the other serious mental disorders that are associated with catatonia include bipolar disorder (an illness in which there are extremes of disturbed mood) and depression (another mood disorder characterized by low mood). Benzodiazepines are widely used in the treatment of catatonia, but there is no good quality evidence from randomised controlled trials concerning their effectiveness.

Searching for evidence
The Information Specialist from Cochrane Schizophrenia ran electronic searches of the group's specialised register (the most recent in February 2019) for trials that randomised people with catatonia occurring in conjunction with schizophrenia or other similar serious mental illnesses to receive either benzodiazepines or any of the following: other drugs, placebo, or electroconvulsive therapy. One hundred and thirty records were found and checked by the review authors.

Evidence found
One trial was found in the search which met the review requirements and provided limited, very low quality usable data for one outcome only. This trial compared two benzodiazepines (lorazepam vs oxazepam) and found no clear difference between these two treatments for improvement in the symptoms of catatonia for people who have catatonia and schizophrenia or similar serious mental illness.

Conclusions
There is insufficient high quality evidence available to answer the review question. More high quality research is needed.

Authors' conclusions

Implications for practice

1. For people with catatonia and carers

We are unable to give any clear statements on the benefits or risks associated with the use of benzodiazepines in the treatment of people with catatonia. As far as we can see, treatment of this distressing and serious condition will continue to be based on anecdotal or case report evidence other than that derived from high‐grade trials. Perhaps carers and recipients of care could highlight the need for good studies and their willingness to take part.

2. For clinicians

We are unable to give any clear statements on the benefits or risks associated with the use of benzodiazepines in the treatment of persons with catatonia. Clinicians will have to continue to make judgements on how to treat people that are probably more based on consensus than evidence from clinical trials. Again, should clinicians show a willingness to take part in clinically relevant evaluative studies, this would make this type of work much more possible.

3. For managers/policy makers

Policy will have to be based on non‐trial, low‐quality evidence and consensus until better data are available.

Implications for research

1. General

Trialists undertaking research in this area should ensure that both methods and data reporting are of the highest quality. Some specific issues include the need for descriptions of both randomisation and the process and testing of blinding. Reporting outcomes using only graphs, summary statistics or P values should be avoided. Instead, authors should present means, standard deviations, confidence intervals, measures of association between intervention and outcome, e.g. relative risks and odds ratios, as well as the raw numbers, if possible. Finally, trialists should report on all findings related to the method they describe. Where cross‐over designs are used, findings from each cross‐over phase should be reported separately.

2. Specific

The use of specific treatments for people with catatonia is not supported by any evidence from randomised trials. The excluded studies and those awaiting assessment demonstrate that such studies are not impossible (Fischer 1974;Kunigiri 2002;Merlis 1962; Patra 1998; Ungvari 1997;Ungvari 1999; Wetzel 1997). We suggest that there is an urgent need to test the effects of benzodiazepines for this condition as, anecdotally, these drugs would appear to be a promising pharmacological approach (Rosebush 1990; Schmider 1999). We suggest a design for such a study in Table 1.

Open in table viewer
Table 1. Suggested design of study

Methods

Participants

Interventions

Outcomes

Notes

Allocation: randomised, block, well described.
Blinding: double, well described and tested.
Duration: 12 to 24 weeks.

Diagnosis: catatonia.
N = 300.*
Age: adults.
Sex: both.
History: clearly reported.

1. Lorazepam 4 mg/day. N = 150.
2. Placebo. N = 150.

General: relapse, general impression of clinician (CGI), carer/other (CGI), compliance with treatment, healthy days, time in hospital, satisfaction with care.
Mental state: CGI.
Catatonic symptoms: BFCRS

Quality of life: CGI.
Family burden: CGI.
Social functioning: return to everyday living for 80% of time.*
Adverse events: any adverse event recorded.
Economic outcomes.

* powered to be able to identify a difference of ˜20% between groups for primary outcome with adequate degree of certainty.

CGI: Clinical Global Impression scale; BFCRS: Bush‐Francis Catatonia Rating Scale

Summary of findings

Open in table viewer
Summary of findings for the main comparison. Lorazepam compared to Oxazepam for catatonia in people with schizophrenia or other SMI

Lorazepam compared to oxazepam for catatonia in people with schizophrenia and/or SMI

Patient or population: catatonia in people with schizophrenia or SMI
Setting: hospital
Intervention: lorazepam
Comparison: oxazepam

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with oxazepam

Risk with lorazepam

Catatonia: clinically important change in catatonia symptoms (improved at least 50% on the VAS)

Study population

RR 0.95
(0.42 to 2.16)

17
(1 RCT)

⊕⊝⊝⊝
VERY LOW 1 2 3

600 per 1000

570 per 1000
(252 to 1000)

Service use: duration of stay in hospital

not reported

Satisfaction with care: clinically important change in satisfaction by informal care‐givers

not reported

Satisfaction with care: clinically important change in satisfaction by recipients of care

not reported

Global state: clinically important change in global state (as defined by individual studies)

not reported

Adverse effects: incidence of clinically important adverse effect

not reported

General functioning: clinically important change in specific aspects of functioning, such as life skills ‒ as defined by each of the studies

not reported

*The risk in the intervention group (and its 95% CI) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; RR: Risk ratio; OR: Odds ratio

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect

1 Serious: downgraded by 1 level due to unclear study design whether any of the following factors were considered; randomisation, allocation concealment, blinding of participants and personnel and blinding of outcome assessment. The study did imply this was a "double‐blind crossover study" but no further description or details provided discussing study design. Study did not report adverse events. Data from 4 participants were excluded in the final analysis.

2 Serious: downgraded by 1 level. Intervention was only delivered once and outcomes measured. Reporting outcomes were measured for one day only after intervention. No long‐term points for outcome measurements. Fixed doses of both interventions used in participants.

3 Serious: downgraded by 1 level due to effect size from 1 small study with a sample size of 17 participants. Wide ranging CI interval around effect estimate.

Background

Description of the condition

Catatonia is a syndrome characterised by motor abnormalities such as purposeless activity, immobility and posturing, together with disturbances of consciousness and volition. Because of its debilitating nature and high risk for complications, it is usually treated in hospital. In 1874 observers described seventeen typical signs, although many more have since been added (Kahlbaum 1973). Catatonia has been described in the context of schizophrenia (a chronic mental illness presenting with features such as hallucinations, delusions, disorganized speech and disorganized behaviour), organic illnesses and mood disorders (APA 2000; WHO 1992). Regardless of the cause, when catatonia is not treated patients are at an increased risk for a number of negative outcomes such as extreme negativism, mutism and echolalia. People with catatonia may develop pneumonia and thromboembolic complications (Regestein 1977). Malnutrition and dehydration are also common (Penland 2006).

Description of the intervention

A number of different treatments have been advocated for catatonia. Attention to underlying causes is important although, in some instances, antipsychotics (used in the treatment of schizophrenia) may worsen catatonia (Penland 2006). Benzodiazepines are medications that act on the central nervous system; they are also referred to as anxiolytics and have sedative properties. Benzodiazepines, especially lorazepam, have been thought to be of some benefit to persons with catatonia (Rosebush 1990). Together with electroconvulsive therapy, benzodiazepines are perhaps the most widely reported treatment for the condition.

How the intervention might work

Neuronal pathways mediated by the neurotransmitter gamma‐aminobutyric acid (GABA) are believed to play a central role in the integration of emotional and cognitive functions. Catatonic symptoms are postulated to arise from dysregulation in these pathways (Ellul 2015). Benzodiazepines potentiate GABAergic activity and are believed to counteract the GABAergic dysregulation underlying catatonic symptoms.

Why it is important to do this review

Benzodiazepines are a common treatment strategy for catatonia; and much of the evidence base for their use in this capacity consists of case reports and small studies. This systematic review attempted to identify and summarise any relevant evidence from trials and thus provide clinicians with a stronger basis for making decisions about their use in the management of catatonia.

Objectives

To compare the effects of benzodiazepines with other drugs, placebo or electroconvulsive therapy for catatonia in people with schizophrenia or other similar serious mental illnesses.

Methods

Criteria for considering studies for this review

Types of studies

We included studies that were described as randomised controlled trials or as 'double blind', in which randomisation is implied, in a sensitivity analysis (see Sensitivity analysis). We excluded quasi‐randomised studies, such as those that allocate intervention by alternate days of the week. Where people were given additional treatments as well as a benzodiazepine we only included studies and analysed the data if the adjunct treatment was evenly distributed between groups and it was only the benzodiazepine that was randomised.

Types of participants

Adults, however defined, with catatonia and schizophrenia or other serious mental illnesses, including schizophreniform disorder, schizoaffective disorder,and bipolar disorder, by any means of diagnosis.

Types of interventions

1. Benzodiazepines of any type, dose and means of administration; compared with
2. Any other class of pharmacological agent at any dose;
3. Placebo;
4. Electroconvulsive therapy.

Types of outcome measures

We would have divided all outcomes into short term (up to 12 weeks), medium term (13 to 26 weeks) and long term (over 26 weeks). Short‐term studies would also be distinguished according to whether they were single injection, one day or longer studies.

Primary outcomes
1. Catatonia

1.1 Clinically important change in catatonia symptoms (as defined by each study)

2. Hospital and service outcomes

2.1 Duration of stay in hospital

3. Satisfaction with care

3.1 Clinically important change in satisfaction (recipient or informal care‐giver)

4. Adverse effect

4.1 Adverse effects ‒ incidence of clinically important adverse effect

Secondary outcomes
1. Catatonia

1.1 Any change in symptoms ‒ as defined by individual studies (binary or continuous measures)

2. Global state

2.1 Clinically important change in global state (e.g. clinical response as defined by the individual studies ‒ e.g. global impression rated as 'much improved', or more than 50% improvement on a rating scale)
2.2 Relapse ‒ as defined by each study
2.3 Any change in global state
2.4 Average endpoint or change score global state scale
2.5 Use of other medications

3. Satisfaction with care

3.1 Clinically important change in satisfaction ‒ professional carers.

4. Adverse effects

4.1 General adverse effects

4.1.1 At least one adverse effect
4.1.2 Clinically important adverse effects
4.1.3 Average endpoint/change scores adverse‐effect scales

4.2 Specific adverse effects ‒ clinically important ‒ as defined by each of the studies

4.2.1 Respiratory depression
4.2.2 Hypotension
4.2.3 Syncope
4.2.4 Ataxia
4.2.5 Hepatitic (e.g. abnormal transaminase, abnormal liver function)
4.2.6 Haematology (e.g. haemogram, leukopenia, agranulocytosis/neutropenia)
4.2.7 Various other
4.2.8 Average endpoint or change score on specific adverse effect scale
4.2.9 Death

5. Leaving the study early

5.1 Due to any reason
5.2 Due to specific reason

6. General functioning

6.1 Overall

6.1.1 Clinically important change in general functioning ‒ as defined by each of the studies, including working ability
6.1.2 Any change in general functioning ‒ as defined by each of the studies, including working ability
6.1.3 Average endpoint or change score on general functioning scale

6.2 Specific

6.2.1 Clinically important change in specific aspects of functioning, such as life skills ‒ as defined by each of the studies
6.2.2 Any change in specific aspects of functioning, such as life skills ‒ as defined by each of the studies
6.2.3 Average endpoint or change score on specific aspects of functioning scale, such as life skills ‒ as defined by each of the studies

'Summary of findings' table

We used the GRADE approach to interpret findings (Schünemann 2008); and used GRADE profiler to import data from Review Manager 5 (RevMan 5) to create a 'Summary of findings' tables (GRADEpro; Review Manager). This table provides outcome‐specific information concerning the overall quality of evidence from the included study, the magnitude of effect of the interventions examined, and the sum of available data on all outcomes we rated as important to patient care and decision making. We selected the following main outcomes for inclusion in the 'Summary of findings' table:

  1. Catatonia: clinically important change in symptoms

  2. Service use: duration of stay in hospital

  3. Satisfaction with care: clinically important change in satisfaction by informal care‐givers

  4. Satisfaction with care: clinically important change in satisfaction by recipients of care

  5. Global state: clinically important change in global state (as defined by individual studies)

  6. Adverse effects: incidence of clinically important adverse effect

  7. General functioning: clinically important change in specific aspects of functioning, such as life skills ‒ as defined by each of the studies

Search methods for identification of studies

Electronic searches

On 9 November 2016 and 6 February 2019, the information specialist searched Cochrane Schizophrenia's study‐based register of trials using the following search strategy:

(*Catatonia* in Health Care Condition) AND (*Benzodiazepine* in Intervention) of STUDY

In such study‐based registers, searching the major concept retrieves all the synonyms and relevant studies because all the studies have already been organised based on their interventions and linked to the relevant topics (Shokraneh 2017).

This register is compiled by systematic searches of major resources (CENTRAL, MEDLINE, Embase, AMED, BIOSIS, CINAHL, ClinicalTrials.Gov, ISRCTN, PsycINFO, PubMed, WHO ICTRP) and their monthly updates, ProQuest Dissertations and Theses A&I and its quarterly update, Chinese databases (CBM, CNKI, and Wanfang) and their annual updates, handsearches, grey literature, and conference proceedings (see Group's website). There are no language, date, document type, or publication status limitations for inclusion of records in the register.

For previous searches, please see Appendix 1.

Searching other resources

1. Reference searching

We would have inspected references of all included studies for further relevant studies.

2. Personal contact

We would have contacted the first author of each included study for information regarding unpublished trials. We would have noted the outcome of this contact in the 'Characteristics of included studies' or 'Characteristics of studies awaiting classification' tables.

Data collection and analysis

Selection of studies

Review authors RG, GW and HZ independently inspected citations from the searches and identified relevant abstracts; we independently re‐inspected all abstracts. If disputes arose, we acquired the full report for more detailed scrutiny. We then independently inspected full reports of the abstracts or reports meeting the review criteria. Where it was not possible to resolve disagreement by discussion, we attempted to contact the authors of the study concerned for clarification.

Data extraction and management

1. Extraction

Review author HZ and JO‐O (see Acknowledgements) independently extracted data from the included study. We presented data from graphs and figures only if there was consistency in extraction between HZ and JO‐O. If studies had been multi‐centre, then where possible we would have extracted data relevant to each. We would have discussed any disagreement and documented our decisions. If necessary, we would have attempted to contact authors through an open‐ended request in order to obtain missing information or for clarification.

2. Management
2.1 Forms

We extracted data onto a standard, pre‐designed, simple form.

2.2 Scale‐derived data

We included continuous data from rating scales only if:

a) the psychometric properties of the measuring instrument had been described in a peer‐reviewed journal (Marshall 2000); and
b) the measuring instrument had not been written or modified by one of the trialists for that particular trial.
c) the instrument was a global assessment of an area of functioning and not sub‐scores which are not, in themselves, validated or shown to be reliable.

Ideally the measuring instrument should either be i. a self‐report or ii. completed by an independent rater or relative (not the therapist). We realise that this is not often reported clearly; in 'Description of studies' we would have noted if this was the case or not.

2.3 Endpoint versus change data

There are advantages of both endpoint and change data: change data can remove a component of between‐person variability from the analysis; however, calculation of change needs two assessments (baseline and endpoint) that can be difficult to obtain in unstable and difficult‐to‐measure conditions such as schizophrenia. We decided primarily to use endpoint data, and only use change data if the former were not available. If necessary, we would have combined endpoint and change data in the analysis, as we aimed to use mean differences (MDs) rather than standardised mean differences (SMDs) throughout (Higgins 2011a).

2.4 Skewed data

Continuous data on clinical and social outcomes are often not normally distributed. To avoid the pitfall of applying parametric tests to non‐parametric data, we applied the following standards to relevant continuous data before inclusion.

For endpoint data from studies including fewer than 200 participants:

a) when a scale starts from the finite number zero, we subtracted the lowest possible value from the mean, and divided this by the standard deviation. If this value is lower than one, it strongly suggests that the data are skewed and we excluded these data. If this ratio is higher than one but less than two, there is suggestion that the data are skewed: we entered these data and, where possible, tested whether their inclusion or exclusion would change the results substantially. Finally, if the ratio is larger than two we would have included these data, because it is less likely that they are skewed (Altman 1996; Higgins 2011a).

b) if a scale starts from a positive value (such as the Positive and Negative Syndrome Scale (PANSS), which can have values from 30 to 210 (Kay 1986)), we would have modified the calculation described above to take the scale starting point into account. In these cases skewed data are present if 2 SD > (S − S min), where S is the mean score and 'S min' is the minimum score.

Please note: we would have entered all relevant data from studies of more than 200 participants in the analysis irrespective of the above rules, because skewed data pose less of a problem in large studies. We also would have entered all relevant change data, as when continuous data are presented on a scale that includes a possibility of negative values (such as change data), it is difficult to tell whether or not data are skewed.

2.5 Common measurement

To facilitate comparison between trials, where relevant we would have converted variables that can be reported in different metrics, such as days in hospital (mean days per year, per week or per month) to a common metric (e.g. mean days per month).

2.6 Conversion of continuous to binary

Where possible, we would have made efforts to convert outcome measures to dichotomous data. This can be done by identifying cut‐off points on rating scales and dividing participants accordingly into 'clinically improved' or 'not clinically improved'. It is generally assumed that if there is a 50% reduction in a scale‐derived score such as the Brief Psychiatric Rating Scale (BPRS) (Overall 1962), or the PANSS (Kay 1986), this could be considered as a clinically significant response (Leucht 2005a, Leucht 2005b). If data based on these thresholds were not available, we would have used the primary cut‐off presented by the original authors.

2.7 Direction of graphs

Where possible, we would have entered data in such a way that the area to the left of the line of no effect indicates a favourable outcome for benzodiazepines. Where keeping to this made it impossible to avoid outcome titles with clumsy double‐negatives (e.g. 'not un‐improved') we would have reported data where the left of the line indicates an unfavourable outcome and noted this in the relevant graphs.

Assessment of risk of bias in included studies

Review authors RG and GW would have worked independently to assess risk of bias by using criteria described in the Cochrane Handbook for Systematic Reviews of Interventions to assess trial quality (Higgins 2011b). This set of criteria is based on evidence of associations between potential overestimation of effect and the level of risk of bias of the article that may be due to aspects of sequence generation, allocation concealment, blinding, incomplete outcome data and selective reporting, or the way in which these 'domains' are reported.

If the raters disagreed, we would have made the final rating by consensus. Where inadequate details of randomisation and other characteristics of trials were provided, we would have attempted to contact authors of the studies in order to obtain further information. We would have reported non‐concurrence in quality assessment, but if disputes arose regarding the category to which a trial was to be allocated, we would have resolved this by discussion.

We would have noted the level of risk of bias in both the text of the review, Figure 1, Figure 2, and the 'Summary of findings' table/s.


Study flow diagram for searches up to February 2019

Study flow diagram for searches up to February 2019


Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.

Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.

Measures of treatment effect

1. Binary data

For binary outcomes we would have calculated a standard estimation of the risk ratio (RR) and its 95% confidence interval (CI), as it has been shown that RR is more intuitive than odds ratios (Boissel 1999); and that odds ratios tend to be interpreted as RR by clinicians (Deeks 2000). Although the number needed to treat for an additional beneficial outcome (NNTB) and the number needed to treat for an additional harmful outcome (NNTH), with their CIs, are intuitively attractive to clinicians, they are problematic to calculate and interpret in meta‐analyses (Hutton 2009). For binary data presented in the 'Summary of findings' table/s we, where possible, would have calculated illustrative comparative risks.

2. Continuous data

For continuous outcomes we would have estimated MD between groups. We preferred not to calculate effect size measures (SMD). However if scales of very considerable similarity were used, we would have presumed there was a small difference in measurement, and we would have calculated effect size and transformed the effect back to the units of one or more of the specific instruments.

Unit of analysis issues

1. Cluster trials

Studies increasingly employ 'cluster randomisation' (such as randomisation by clinician or practice), but analysis and pooling of clustered data poses problems. Firstly, authors often fail to account for intra‐class correlation in clustered studies, leading to a unit‐of‐analysis error whereby P values are spuriously low, CIs unduly narrow and statistical significance overestimated (Divine 1992). This causes type I errors (Bland 1997; Gulliford 1999).

Should clustering have been incorporated into the analysis of primary studies, we would have presented these data as if from a non‐cluster randomised study, but adjusted for the clustering effect.

Should clustering not have been accounted for in primary studies, we would have presented data in a table, with a (*) symbol to indicate the presence of a probable unit of analysis error. We would have contacted first authors of studies to obtain intra‐class correlation coefficients for their clustered data and adjusted for this by using accepted methods (Gulliford 1999).

We sought statistical advice and have been advised that the binary data from cluster trials presented in a report should be divided by a 'design effect'. This is calculated using the mean number of participants per cluster (m) and the intra‐class correlation coefficient (ICC): thus design effect = 1 + (m − 1) * ICC (Donner 2002). If the ICC had not been reported we would have assumed it to be 0.1 (Ukoumunne 1999).

If cluster studies have been appropriately analysed and intra‐class correlation coefficients and relevant data documented in the report taken into account, synthesis with other studies would have been possible using the generic inverse variance technique.

2. Cross‐over trials

A major concern of cross‐over trials is the carry‐over effect. This occurs if an effect (e.g. pharmacological, physiological or psychological) of the treatment in the first phase is carried over to the second phase. As a consequence, participants can differ significantly from their initial state at entry to the second phase, despite a wash‐out phase. For the same reason cross‐over trials are not appropriate if the condition of interest is unstable (Elbourne 2002). As both carry‐over and unstable conditions are very likely in severe mental illness, we would have only used data from the first phase of cross‐over studies.

3. Studies with multiple treatment groups

Where a study involved more than two treatment arms, if relevant we would have presented the additional treatment arms in comparisons. If data were binary we simply would have added these and combined within the two‐by‐two table. If data were continuous we would have combined data following the formula for combining data in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011a). Where additional treatment arms were not relevant, we would not have reproduced these data.

Dealing with missing data

1. Overall loss of credibility

At some degree of loss of follow‐up, data must lose credibility (Xia 2009). We chose that, for any particular outcome, should more than 50% of data be unaccounted for we would not reproduce these data or use them within analyses. If, however, more than 50% of those in one arm of a study had been lost, but the total loss had been less than 50%, we would have addressed this within the 'Summary of findings' table/s by down‐rating quality. Finally, we would have also downgraded quality within the 'Summary of findings' table/s should the loss have been 25% to 50% in total.

2. Binary

In the case where attrition for a binary outcome is between 0% and 50% and where these data are not clearly described, we would have presented data on a 'once‐randomised‐always‐analyse' basis (an intention‐to‐treat analysis (ITT)). Those leaving the study early would have all been assumed to have the same rates of negative outcome as those who completed, with the exception of the outcome of death and adverse effects. For these outcomes the rate of those who stayed in the study — in that particular arm of the trial — would have been used for those who did not. We would have undertaken a sensitivity analysis testing how prone the primary outcomes would have been to change when data only from people who completed the study to that point were when compared to the intention‐to‐treat analysis using the above assumptions.

3. Continuous
3.1 Attrition

We would have used data where attrition for a continuous outcome was between 0% and 50%, and data only from people who completed the study to that point were reported.

3.2 Standard deviations

If standard deviations (SDs) had not been reported, we would have tried to obtain the missing values from the authors. If these were not available, where there were missing measures of variance for continuous data, but an exact standard error (SE) and CIs were available for group means, and either P value or t value were available for differences in mean, we would have been able to calculate SDs according to the rules described in the Cochrane Handbook for Systematic Reviews of Interventions (Deeks 2011). When only the SE is reported, SDs are calculated by the formula SD = SE * √(n). The Cochrane Handbook for Systematic Reviews of Interventions presents detailed formulae for estimating SDs from P, t or F values, CIs, ranges or other statistics (Deeks 2011). If these formulae did not apply, we would have calculated the SDs according to a validated imputation method which is based on the SDs of the other included studies (Furukawa 2006). Although some of these imputation strategies can introduce error, the alternative would be to exclude a given study’s outcome and thus to lose information. Nevertheless, we would have examined the validity of the imputations in a sensitivity analysis that excludes imputed values.

3.3 Assumptions about participants who left the trials early or were lost to follow‐up

Various methods are available to account for participants who left the trials early or were lost to follow‐up. Some trials just present the results of study completers; others use the method of last observation carried forward (LOCF); while more recently, methods such as multiple imputation or mixed‐effects models for repeated measurements (MMRM) have become more of a standard. While the latter methods seem to be somewhat better than LOCF (Leon 2006), we feel that the high percentage of participants leaving the studies early and differences between groups in their reasons for doing so is often the core problem in randomised schizophrenia trials. We therefore did not exclude studies based on the statistical approach used. However, by preference we would have used the more sophisticated approaches, i.e. we would have preferred to use MMRM or multiple‐imputation to LOCF, and we would have only presented completer analyses if some kind of ITT data were not available at all. Moreover, we would have addressed this issue in the item 'Incomplete outcome data' of the 'Risk of bias' tool.

Assessment of heterogeneity

1. Clinical heterogeneity

We would have considered all included studies initially, without seeing comparison data, to judge clinical heterogeneity. We would have simply inspected all studies for participants who are clearly outliers or situations that we had not predicted would arise and, where found, discussed such situations or participant groups.

2. Methodological heterogeneity

We would have considered all included studies initially, without seeing comparison data, to judge methodological heterogeneity. We would have simply inspected all studies for clearly outlying methods which we had not predicted would arise and discussed any such methodological outliers.

3. Statistical heterogeneity
3.1 Visual inspection

We would have inspected graphs visually to investigate the possibility of statistical heterogeneity.

3.2 Employing the I² statistic

We would have investigated heterogeneity between studies by considering the I² statistic alongside the Chi² P value. The I² statistic provides an estimate of the percentage of inconsistency thought to be due to chance (Higgins 2003). The importance of the observed value of I² depends on the magnitude and direction of effects as well as the strength of evidence for heterogeneity (e.g. P value from Chi² test, or a confidence interval for I²). We would have interpreted an I² estimate greater than or equal to 50% and accompanied by a statistically significant Chi² statistic as evidence of substantial heterogeneity (Deeks 2011). When substantial levels of heterogeneity had been found in the primary outcome, we would have explored reasons for heterogeneity (Subgroup analysis and investigation of heterogeneity).

Assessment of reporting biases

Reporting biases arise when the dissemination of research findings is influenced by the nature and direction of results (Egger 1997). These are described in Chapter 10 of the Cochrane Handbook for Systematic Reviews of Interventions (Sterne 2011). We are aware that funnel plots may be useful in investigating reporting biases, but are of limited power to detect small‐study effects. We would not have used funnel plots for outcomes where there were 10 or fewer studies, or where all studies were of similar size. In other cases, where funnel plots were possible, we would have sought statistical advice in their interpretation.

Data synthesis

We understand that there is no closed argument for preference for use of fixed‐effect or random‐effects models. The random‐effects method incorporates an assumption that the different studies are estimating different, yet related, intervention effects. This often seems to be true to us and the random‐effects model takes into account differences between studies, even if there is no statistically significant heterogeneity. There is, however, a disadvantage to the random‐effects model: it puts added weight onto small studies, which often are the most biased ones. Depending on the direction of effect, these studies can either inflate or deflate the effect size. We would have chosen a fixed‐effect model for all analyses.

Subgroup analysis and investigation of heterogeneity

1. Subgroup analyses

We did not anticipate any subgroup analyses.

2. Investigation of heterogeneity

We would have reported if inconsistency was high. Firstly, we would have investigated whether data had been entered correctly. Secondly, if data were correct, we would have inspected the graph visually and removed outlying studies successively to see if homogeneity could be restored. For this review we decided that should this occur with data contributing to the summary finding of no more than 10% of the total weighting, we would have presented data. If not, we would not have pooled these data and discussed any issues. We know of no supporting research for this 10% cut‐off but are investigating use of prediction intervals as an alternative to this unsatisfactory state.

In cases where unanticipated clinical or methodological heterogeneity was obvious, we simply would have stated hypotheses regarding these for future reviews or versions of this review. We did not anticipate undertaking analyses relating to these.

Sensitivity analysis

1. Implication of randomisation

We would have included trials in a sensitivity analysis if they had been described in some way that implied randomisation. For primary outcomes, if the inclusion of these trials did not result in a substantive difference, they would have remained in the analyses. If their inclusion did result in statistically significant differences, we would not have added the data from these lower‐quality studies to the results of the higher‐quality trials, but presented these data within a subcategory.

2. Assumptions for lost binary data

Where assumptions would have to be made regarding people lost to follow‐up (see Dealing with missing data), we would have compared the findings of the primary outcomes when we used our assumption compared with completer data only. If there had been a substantial difference, we would have reported results and discussed them, but continued to employ our assumption.

Where assumptions would have to be made regarding missing SD data (see Dealing with missing data), we would have compared the findings of primary outcomes when we used our assumption compared with complete data only. We would have undertaken a sensitivity analysis to test how prone results were to change when completer data only were compared to the imputed data using the above assumption. If there had been a substantial difference, we would have reported results and discussed them, but continued to employ our assumption.

3. Risk of bias

We would have analysed the effects of excluding trials that were judged to be at high risk of bias across one or more of the 'Risk of bias' domains (implied as randomised with no further details available, allocation concealment, blinding and outcome reporting) for the meta‐analysis of the primary outcome. If the exclusion of trials at high risk of bias had not altered the direction of effect or the precision of the effect estimates substantially, then we would have included relevant data from these trials.

4. Imputed values

We would have undertaken a sensitivity analysis to assess the effects of including data from trials where we used imputed values for ICC in calculating the design effect in cluster‐randomised trials.

If we had noted substantial differences in the direction or precision of effect estimates in any of the sensitivity analyses listed above, we would not have pooled data from the excluded trials with the other trials contributing to the outcome, but would have presented them separately.

5. Fixed‐effect and random‐effects

We synthesised data using a fixed‐effect model; however, we also would have synthesised data for the primary outcome using a random‐effects model to evaluate whether this would have altered the significance of the results.

Results

Description of studies

For further descriptions of the included and excluded studies please see Characteristics of included studies and Characteristics of excluded studies.

Results of the search

1. 2007 searching

We found 130 citations using the search strategies. Of the 130 citations, we obtained 30 full‐text articles related to 22 studies for detailed inspection. We excluded 17 of these studies and we need further information from five studies.

2. 2016 and 2019 searching

We found no new records in these searches; however we were able extract data from one of the studies awaiting assessment and have now included it in the review (Schmider 1999). We have now excluded the other four studies (Merlis 1962; Patra 1998; Ungvari 1999; Wetzel 1997). One previously excluded study, Smith 1961, was another citation of Smith 1960 and we have added it as a reference to Smith 1960. This review now has one included study and 20 are excluded. Please also see Figure 1.

Included studies

We could only include one study (Schmider 1999 ).

1. Length of study

The study duration was three days. Baseline observations were conducted on the first day of the trial; on day two participants received the intervention; and on day three, patients were crossed over.

2. Participants

Twenty‐one participants participated in this trial. They presented with significant psychomotor retardation, mutism and on the Bech‐Rafaelsen Melancholia Scale (BPRS) were rated 3 in each of the criteria for psychomotor retardation. Participants had a range of diagnoses based on the DSM‐III‐R criteria: severe major depressive episode; major depressive episode with psychotic features; bipolar disorder and concurrently depressed; schizoaffective disorder and concurrently depressed; schizophrenic disorder; and schizophreniform disorder. The mean age of the participants was 50.8 years (range 21 to 77 years).

3. Setting

The trial took place in a hospital setting.

4. Study size

Twenty‐one participants were originally enlisted in the trial, four participants were excluded from the final analysis. Two required medication during baseline evaluation and two were later diagnosed with Parkinson’s disease. The final analysis included 17 participants: 13 women and four men.

5. Interventions

The trial was a direct comparison between two benzodiazepines (lorazepam and oxazepam).

5.1 Lorazepam group

Seven participants received lorazepam in the first arm of the trial. They were given 2 mg lorazepam sublingually as a single dose (on day 2) then crossed over to receive 60 mg oxazepam sublingually as a single dose (on day 3).

5.2 Oxazepam group

Ten patients received oxazepam in the first arm of the trial. They were given 60 mg oxazepam sublingually as a single dose (on day 2) then crossed over to receive 2 mg lorazepam sublingually as a single dose (on day 3).

6. Outcomes

The study only measured one outcome which was to evaluate the degree of catatonic‐like symptoms with the Visual Analogue Scale at baseline day 1, day 2 and day 3 of the trial after intervention exposure. Due to the cross‐over trial design we could only extract data from the graph for day 2 where patients either received lorazepam or oxazepam before the cross‐over took place.

6.1 Rating scales used by the included study

The study used a Visual Analogue Scale (VAS) to measure the degree of catatonic‐like symptoms. The VAS has been used in other studies to measure symptom control, for example dyspnoea, asthma, pain and total hip arthroplasty.

The VAS, which measures 100 mm in length, was used through the 3‐day trial and was administered by resident psychiatrists trained in the use of this scale. After the administration of the intervention, VAS was used every two hours on seven occasions between 08:00 and 20:00 hours during baseline (day 1), days 2 and 3. The interpretation was that the higher the value (closer to 100 mm) that the patient scored on the VAS scale, the more the patient was experiencing increased levels of catatonic symptoms. Lower VAS scores on the scale, i.e. closer to the 10‐mm end, indicated the patient was experiencing fewer catatonic‐like symptoms.

Excluded studies

We excluded 20 studies. Six were not randomised; eight included participants with schizophrenia or other severe mental illnesses but these participants did not also have, or it was not clear that they had, catatonia; two trials did not use benzodiazepines as a treatment intervention; one trial did not collect pre‐cross‐over data for the intervention of interest; in two of the trials there was no usable data for our outcomes (see Types of outcome measures); and in one trial no independent data were presented for the participants who had catatonia.

Only 10 of these were randomised. Many of both the randomised and the non‐randomised excluded studies did not focus specifically on treatment outcomes in people with catatonia. Of the 10 randomised studies, only three included more than one person with catatonia (Fischer 1974; Kunigiri 2002; Ungvari 1997); and of these, none employed benzodiazepines as a therapeutic intervention.

1. Studies awaiting assessment

There are no studies awaiting assessment.

2. Ongoing studies

We did not identify any ongoing studies.

Risk of bias in included studies

Please also see Characteristics of included studies and Figure 2; Figure 3


Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Allocation

There is no discussion within the Methods section of this study detailing whether participants were randomised to intervention arms but it is stated that the study was of a "double blind" design, implying randomisation. Methods for allocation concealment were not reported. We rated this study at high risk for selection bias.

Blinding

In the Methods section it states that this is a "double‐blind crossover study"; however no further information is provided as to how the participants were blinded to treatment, and there are no details concerning blinding of the personnel administering the VAS, i.e. if they were blinded to which intervention each participant received. We rated this unclear risk for performance and detection bias.

Incomplete outcome data

We rated this study at unclear risk for attrition bias. Data from four participants (19% of the total number of participants) were not used but the study clearly reported the reasons for this.

Selective reporting

There was only one measured outcome for this study: the degree of catatonic symptoms after each intervention was received, reported before and after cross‐over. However, it is to be noted that any adverse reactions to interventions were not reported during the trial. We rated the study to be at unclear risk for reporting bias.

Other potential sources of bias

The period of time between cross‐over from one drug to another could lead to a carry‐over effect from the previous drug which could contribute towards improved additive effect. The authors do mention the cross‐over time interval was "relatively short" due to ethical reasons preventing a longer unmediated period between the different medications. However, we only used data from the first arm of the trial and as no other potential sources of bias are apparent, we rated risk of bias to to be low for this domain.

Effects of interventions

See: Summary of findings for the main comparison Lorazepam compared to Oxazepam for catatonia in people with schizophrenia or other SMI

1. Lorazepam versus oxzepam

1.1 Catatonia: clinically important change in symptoms (improved at least 50% on VAS)

There was no clear difference between treatment groups (RR 0.95, 95% CI 0.42 to 2.16; participants = 17; studies = 1; very low quality evidence, Analysis 1.1).

2. Catatonia: any change in symptoms ‒ average total score (VAS, endpoint, high = poor)

There was no clear difference between treatment groups (MD 1.18, 95% CI −1.99 to 4.35; participants = 17; studies = 1, Analysis 1.2).

3. Missing outcomes

No data were available for any of our other outcomes listed in our protocol (see Types of outcome measures).

Discussion

Summary of main results

We found some small relevant trials, but we were only able to extract data for analysis from one study. This study was a direct comparison between two benzodiazepines and results showed no clear difference between the two treatments. The quality of the evidence is very low due to the low number of participants involved and risk of bias within this trial. Currently, to our knowledge, there is no high‐quality evidence available regarding the effects of benzodiazepines for catatonia in people with schizophrenia or similar SMI.

Overall completeness and applicability of evidence

The completeness of evidence available in this review is poor, only one small study was available. The applicability of the study is also poor. The dose of lorazepam (2 mg) used in the study was low compared to doses reported in literature for the management of catatonic symptoms (where doses used vary between 8 mg and 24 mg daily) (Dhossche 2016). The dose of oxazepam used in this study was based on the manufacturer's advice and in clinical practice is used rarely, lorazepam being preferred treatment of choice (Bush 1997). It is also important to note here that only a single dose of benzodiazepine was used in this trial. In clinical practice it is generally accepted that a person presenting with catatonic symptoms would continue to receive benzodiazepines until successful remission of symptoms (Gover 2011).

Quality of the evidence

The quality of evidence available for use in this review is 'very low' due to small sample size and high risks of bias in the methods. The need for well designed methodologically and clinically relevant RCTs with comprehensive reporting of short‐ and long‐term outcomes in relation to catatonic symptoms is required.

Potential biases in the review process

1. The search

It is feasible that we have failed to identify relevant studies by using a search that was biased. Every effort is made by Cochrane Schizophrenia to ensure that searches are as comprehensive as possible, but there is evidence that studies from low‐ and middle‐income countries are not well represented in some of the databases we employ, and it is from these countries that relevant trials may be more likely to come. We, however, do not think that we would have missed any large study as we think such a trial would have been widely reported.

Agreements and disagreements with other studies or reviews

Due to one study with a small data set being included in this review, we are unable to fully comment on agreement or disagreement with other studies or reviews. There still is insufficient trial‐derived evidence about the usefulness of benzodiazepines in the treatment of people with catatonia. At the time of writing this review, clinicians, patients and carers can base decisions about using benzodiazepines to treat people with catatonia only on anecdotal cases or small studies with some deficiencies in methodology or reporting of findings, or both. Such cases and studies have tended to support the use of benzodiazepines in the treatment of people with catatonia (Rosebush 1990).

Catatonia is a syndrome with a variety of disparate illnesses and range of symptoms responsive to benzodiazepines, although this response is poorly understood; it is dramatic and evident to see in patients. Due to the heterogeneous aetiology of catatonia it is difficult to predict how the anti‐catatonic effects of benzodiazepines are exerted; whether it is through the dopaminergic, GABA or cholinergic systems, it is unlikely that a single mode of action or mechanism will be found. Benzodiazepines remain the first choice of treatment and are regarded as safe, easy and effective to use with remission rates reported as high as 70% to 80% (Rosebush 2010). The most important indicator for efficacy in treatment is the dose used (Fink 2006). In the study included in this review the dose of lorazepam used was 2 mg as a single dose compared to more commonly used doses varying between 8 mg and 24 mg per day, which have been tolerated without any major adverse effects such as sedation (Dhossche 2016). Despite the low dose of lorazepam used, this was still associated with improvements in catatonic symptoms (4 out of 7 patients improved 50% on average for catatonic symptoms on day 2). Similarly, in the oxazepam intervention arm 6 out of 10 patients improved by 50% in catatonic symptoms but no statistical difference could be found (RR 0.95, 95% CI 0.42 to 2.16). It is also important to note that in clinical practice single dose administration of catatonic symptoms is very rarely used. Most authors suggest titrating doses according to response which can be seen usually within three to seven days of treatment (Daniels 2009). However, in the study included the total duration of trial was three days which could have been an inadequate time to see a clinically noticeable effect. Although lorazepam has generally been accepted as first line treatment for catatonia, successful use of diazepam (Hung 2006) and clonazepam (Lee 2000) has been reported in the literature. Analysis of results from this single study does not clearly demonstrate the effectiveness of benzodiazepines for treating catatonia (MD 1.18, 95% CI −1.99 to 4.35) which could be due to the small sample size and low doses used. From the data generated from this one study it is very difficult to determine and report whether one benzodiazapine is superior to another.

Study flow diagram for searches up to February 2019
Figures and Tables -
Figure 1

Study flow diagram for searches up to February 2019

Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.
Figures and Tables -
Figure 2

Risk of bias graph: review authors' judgements about each risk of bias item presented as percentages across all included studies.

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.
Figures and Tables -
Figure 3

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Comparison 1 LORAZEPAM versus OXAZEPAM, Outcome 1 Catatonia: Clinically important change in symptoms (improved at least 50% on the VAS).
Figures and Tables -
Analysis 1.1

Comparison 1 LORAZEPAM versus OXAZEPAM, Outcome 1 Catatonia: Clinically important change in symptoms (improved at least 50% on the VAS).

Comparison 1 LORAZEPAM versus OXAZEPAM, Outcome 2 Catatonia: Any change in symptoms ‐ average total score (VAS, endpoint, high = poor).
Figures and Tables -
Analysis 1.2

Comparison 1 LORAZEPAM versus OXAZEPAM, Outcome 2 Catatonia: Any change in symptoms ‐ average total score (VAS, endpoint, high = poor).

Table 1. Suggested design of study

Methods

Participants

Interventions

Outcomes

Notes

Allocation: randomised, block, well described.
Blinding: double, well described and tested.
Duration: 12 to 24 weeks.

Diagnosis: catatonia.
N = 300.*
Age: adults.
Sex: both.
History: clearly reported.

1. Lorazepam 4 mg/day. N = 150.
2. Placebo. N = 150.

General: relapse, general impression of clinician (CGI), carer/other (CGI), compliance with treatment, healthy days, time in hospital, satisfaction with care.
Mental state: CGI.
Catatonic symptoms: BFCRS

Quality of life: CGI.
Family burden: CGI.
Social functioning: return to everyday living for 80% of time.*
Adverse events: any adverse event recorded.
Economic outcomes.

* powered to be able to identify a difference of ˜20% between groups for primary outcome with adequate degree of certainty.

CGI: Clinical Global Impression scale; BFCRS: Bush‐Francis Catatonia Rating Scale

Figures and Tables -
Table 1. Suggested design of study
Summary of findings for the main comparison. Lorazepam compared to Oxazepam for catatonia in people with schizophrenia or other SMI

Lorazepam compared to oxazepam for catatonia in people with schizophrenia and/or SMI

Patient or population: catatonia in people with schizophrenia or SMI
Setting: hospital
Intervention: lorazepam
Comparison: oxazepam

Outcomes

Anticipated absolute effects* (95% CI)

Relative effect
(95% CI)

№ of participants
(studies)

Certainty of the evidence
(GRADE)

Comments

Risk with oxazepam

Risk with lorazepam

Catatonia: clinically important change in catatonia symptoms (improved at least 50% on the VAS)

Study population

RR 0.95
(0.42 to 2.16)

17
(1 RCT)

⊕⊝⊝⊝
VERY LOW 1 2 3

600 per 1000

570 per 1000
(252 to 1000)

Service use: duration of stay in hospital

not reported

Satisfaction with care: clinically important change in satisfaction by informal care‐givers

not reported

Satisfaction with care: clinically important change in satisfaction by recipients of care

not reported

Global state: clinically important change in global state (as defined by individual studies)

not reported

Adverse effects: incidence of clinically important adverse effect

not reported

General functioning: clinically important change in specific aspects of functioning, such as life skills ‒ as defined by each of the studies

not reported

*The risk in the intervention group (and its 95% CI) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).

CI: Confidence interval; RR: Risk ratio; OR: Odds ratio

GRADE Working Group grades of evidence
High certainty: We are very confident that the true effect lies close to that of the estimate of the effect
Moderate certainty: We are moderately confident in the effect estimate: the true effect is likely to be close to the estimate of the effect, but there is a possibility that it is substantially different
Low certainty: Our confidence in the effect estimate is limited: the true effect may be substantially different from the estimate of the effect
Very low certainty: We have very little confidence in the effect estimate: the true effect is likely to be substantially different from the estimate of effect

1 Serious: downgraded by 1 level due to unclear study design whether any of the following factors were considered; randomisation, allocation concealment, blinding of participants and personnel and blinding of outcome assessment. The study did imply this was a "double‐blind crossover study" but no further description or details provided discussing study design. Study did not report adverse events. Data from 4 participants were excluded in the final analysis.

2 Serious: downgraded by 1 level. Intervention was only delivered once and outcomes measured. Reporting outcomes were measured for one day only after intervention. No long‐term points for outcome measurements. Fixed doses of both interventions used in participants.

3 Serious: downgraded by 1 level due to effect size from 1 small study with a sample size of 17 participants. Wide ranging CI interval around effect estimate.

Figures and Tables -
Summary of findings for the main comparison. Lorazepam compared to Oxazepam for catatonia in people with schizophrenia or other SMI
Comparison 1. LORAZEPAM versus OXAZEPAM

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Catatonia: Clinically important change in symptoms (improved at least 50% on the VAS) Show forest plot

1

17

Risk Ratio (M‐H, Fixed, 95% CI)

0.95 [0.42, 2.16]

2 Catatonia: Any change in symptoms ‐ average total score (VAS, endpoint, high = poor) Show forest plot

1

17

Mean Difference (IV, Fixed, 95% CI)

1.18 [‐1.99, 4.35]

Figures and Tables -
Comparison 1. LORAZEPAM versus OXAZEPAM