Introduction

In the UK, around 30,000 women per year undergo breast conserving treatment (BCT) for breast cancer [1]. The combination of surgery and radiotherapy has achieved good local control rates [2, 3] and is equivalent to mastectomy. However, the technical challenges of completely excising tumour whilst also re-shaping breast tissue to provide an aesthetically acceptable outcome yield variable cosmetic results and variable patient satisfaction. Therefore, although the long-term success of BCT is measured primarily by local control, increased survivorship (87% at 5 years) [4], demands that the physical and psychological effects of treatment, especially long-term effects, are addressed. Dissatisfaction with the appearance after treatment acts as a constant reminder of the disease and affects a woman’s psychological wellbeing [5,6,7].

Assessment of aesthetic outcome has, to date, remained a challenge for breast and plastic surgeons alike. There is no gold standard or consensus on which factors should be assessed and who should undertake the analysis [8]. The mainstays of assessment are subjective, using either patient-reported outcome measures (PROMs) [9,10,11,12] or panel assessment of appearance [13, 14]. More recently, objective assessments have been developed in the form of the Breast Analysing Tool (BAT©) [15] and the Breast Cancer Conservative Treatment cosmetic result software (BCCT.core) [16, 17] whereby a single photograph (anterior view) of the patient is analysed to give an aesthetic outcome score between 1 and 4.

Three-dimensional surface imaging (3D-SI) has been used as a marketing tool in aesthetic breast surgery. Three-dimensional (3D) images are taken and manipulated to illustrate to potential patients how they may appear after augmentation or mammoplasty/mastopexy. A survey of 1067 plastic surgeons in America revealed that 15% of surgeons are using 3D-SI technology in their practice [18]. We have reported on the use of 3D-SI as a research and clinical tool in aesthetic, oncoplastic and reconstructive breast surgery [19] using linear distances and 3D measurements such as volume and symmetry. Several other studies have validated its use in volume measurement [20,21,22,23,24,25,26,27,28,29] as a surgical planning tool. The aim of this study was to establish whether symmetry of volume and shape on 3D-SI following BCT can be used as surrogate markers of aesthetic outcome (as judged by a panel assessment or by PROMs).

Methods

Patient recruitment

Research Ethical Committee (REC) approval was obtained (ClinicalTrials.gov Identifier: NCT02304614). Women aged ≥18 years who had undergone BCT (wide local excision and adjuvant whole breast radiotherapy) for an invasive or in situ carcinoma in our unit (four permanent surgeons) between one and six years before the start of the study met the inclusion criteria. The following patients were excluded: those who developed recurrent (local or distant) disease since BCT, those who had previously or subsequently undergone surgery to the index or contralateral breast and those who were unable to undergo 3D-SI (e.g. unable to stand for 5 min).

Patients were invited to participate in the study by letter, followed up by a telephone call. Those who agreed to participate were offered an appointment for 3D-SI before or after their mammogram or at another mutually convenient time. The 3D-SI images were taken in the medical photography department.

3D-SI breast imaging

Participants were imaged using the Vectra-XT 3D-SI system with their hands on hips. Volume symmetry was calculated by dividing the volume of the smaller breast by the volume of the larger breast and converting to a percentage value. The closer the result to 100, the more symmetrical the breasts were in terms of volume. Shape symmetry was calculated by bisecting the 3D image vertically along the midline and reflecting the image of the left breast onto the right. The root mean squared (RMS) of the distances between the two superimposed breast surfaces was calculated in millimetres (Fig. 1). The lower the score the more symmetrical the breasts are in terms of shape.

Fig. 1
figure 1

Example of calculation of shape symmetry using the root mean squared (RMS). Reproduced with the permission of Canfield Scientific. The left breast image is reflected onto the right. In the first image a geometric pattern is applied to one breast image. The perpendicular distances from all the interception points of the grid to the other breast is calculated (second and third image)

Panel assessment

The panel consisted of four members, two breast surgeons, one clinical oncologist and one breast care nurse. Panellists were asked to rate the participants’ images according to the 4-point Harvard cosmesis scale (Table 1). Initially, the scoring system was to assess the effect of radiotherapy [13] but has since then been adapted to a cosmetic outcome [30]. The 3D images were rotated so that they could be reviewed from either side, cranially looking downwards towards the cleavage and caudally upwards from the feet towards the inferior mammary fold (IMF). The panel scored independently and then the opinion of the majority was assigned as a consensus score. If there was disparity in scoring the images, a discussion ensued to reach a consensus; the method used in other studies with a cosmetic outcome endpoint [31,32,33]. To evaluate the test–retest variation, 10% of all the images were randomly shown twice and the two scores compared.

Table 1 4-Point Harvard cosmesis scale

Patient satisfaction

The BREAST-Q is a questionnaire devised by the Memorial Sloan-Kettering Cancer Center to elicit and quantify patient perception of outcomes after breast surgery [12, 34]. It has been developed using extensive patient input and Rasch psychometric methods [35, 36] to measure patient satisfaction and quality of life. Modules have been developed for patients undergoing mastectomy, breast reconstruction and most recently Breast Conserving Therapy (BCT). This module contains eight domains. However, only the ‘Satisfaction with breasts’ domain and some of its sub-questions were used in this study. Each domain contains sub-questions and the ‘raw’ score is transformed to a score ranging from 0 to 100, where a higher score indicates greater satisfaction. We have previously reported results of all of the domains of the BREAST-Q BCT module [37].

Statistical analysis

Demographics were presented as descriptive statistics using mean and standard deviation or median and IQR range, as appropriate, after testing for normality. Categorical data were presented as proportions and frequencies with 95% confidence intervals (CIs) where appropriate.

The Kruskal–Wallis test was used to assess the relationship between categorical (e.g. panel assessment, sub-questions from ‘Satisfaction with breasts’) and continuous variables (e.g. volume symmetry, shape symmetry, ‘Satisfaction with breasts’), after testing for normality. The Dunn–Sidak test (a post hoc adjusted pairwise comparison) was used to identify between which pairs of categorical results the significant differences lay. Any pairwise comparisons with a P < 0.05 were considered statistically significant.

Spearman’s rho correlations were used to assess the relationship between two continuous variables after testing for normality. A correlation coefficient result between 0.00 and 0.019 indicates very weak correlation, 0.20 and 0.39 is weak, 0.40 and 0.59 is moderate, 0.60 and 0.79 is strong and 0.80 and 1.0 is very strong.

A kappa statistic was used for comparison of two categorical datasets. A value of 0 indicates no agreement, 0–0.20 is slight, 0.21–0.40 is fair, 0.41–0.60 is moderate, 0.61–0.80 is substantial, and 0.81–1 is almost perfect. Analysis was undertaken using STATA (STATA, Inc., Texas).

Results

Between 01/04/2015 and 31/10/2015, 649 women were scheduled to have a surveillance mammogram. Three hundred and forty two (53.7%) women were eligible and had a mammogram booked at a time when the investigator (ROC) was available. All were invited but 109 were not contactable by phone to confirm participation. Of the 233 women who were contactable, 206 (88.4%) agreed to participate and 27 (11.6%) declined. In total, 200 (85.8%) women participated. The clinicopathological characteristics of the study population are summarised in Table 2.

Table 2 Summary of study participants’ clinicopathological characteristics

Median panel score was 3 (IQR 2–4). Eight (4%) participants were assigned a panel score of poor, 62 (31%) scored fair, 78 (39%) scored good and 52 (26%) scored excellent. For the 10% test–retest validation of the panel assessment, the weighted agreement was 98.3%, Kappa 0.96 (almost perfect), (P < 0.001). Median score for ‘Satisfaction with breasts’ was 69.5 (IQR 31–100).

The median treated breast volume was 456 cm3 (IQR 323–680 cm3), median untreated breast volume was 493 cm3 (IQR 340–740 cm3). The median volume symmetry was 87% (IQR 78–93%) (Fig. 2). The median shape symmetry, RMS, was 5.9 mm (IQR 4.2–8.0 mm) (Fig. 3).

Fig. 2
figure 2

Frequency distribution of volume symmetry

Fig. 3
figure 3

Frequency distribution of shape symmetry, root mean squared (RMS)

Relationship of objective 3D-SI measurements of volume and shape symmetry with panel assessment

Volume symmetry

The median volume symmetry measurements differed between panel assessment groups (Kruskal–Wallis test, P = 0.028) (Table 3). Post hoc pairwise comparison using Dunn–Sidak test further demonstrated that these differences existed between panel scores ‘fair’ and ‘good’, as well as between ‘fair’ and ‘excellent’ (Fig. 4). Participants deemed ‘poor’ by the panel did not have a significantly different volume symmetry from those deemed ‘good’ (P = 0.645) or ‘excellent’ (P = 0.528). This indicates that a ‘poor’ panel assessment did not necessarily relate to volume symmetry but may be biased by the very small numbers of patients in this category.

Table 3 Volume and shape symmetry according to 3D-SI panel scores
Fig. 4
figure 4

Box and whisker plot demonstrating volume symmetry (%) according to consensus panel assessment of aesthetic outcome. The horizontal lines within each box represent median scores, the outer horizontal lines of each box represent upper and lower quartiles, and the ends of the vertical lines represent minimum and maximum scores. On post hoc pair wise comparisons there was a significant difference in volume symmetry when comparing ‘fair’ with ‘good’ and ‘fair’ with ‘excellent’ panel scores. The other comparisons were not significant

Shape symmetry

The median shape symmetry measurements differed between panel assessment groups (Kruskal–Wallis test, P < 0.001) (Table 3). Post hoc pairwise comparison using the Dunn–Sidak test further demonstrated that these differences arose between panel scores ‘poor’ and ‘good’, ‘poor’ and ‘excellent’, ‘fair’ and ‘good’ as well as ‘fair’ and ‘excellent’ (Fig. 5). This indicates that shape symmetry was always significantly different between those assigned a poor or fair score and those considered to be good or excellent.

Fig. 5
figure 5

Box and whisker plot demonstrating shape symmetry RMS, mm) according to consensus panel assessment of aesthetic outcome. The horizontal lines within each box represent median scores, the outer horizontal lines of each box represent upper and lower quartiles, and the ends of the vertical lines represent minimum and maximum scores. On post hoc pair wise comparison there was a significant difference in shape symmetry when comparing poor with good, poor with excellent, fair with good and fair with excellent. The other comparisons were not significant

Relationship of objective 3D-SI measurements of volume and shape symmetry with patient satisfaction

Volume symmetry

There was a significant positive correlation between volume symmetry measurements and ‘Satisfaction with breasts’ scores; however, the correlation was very weak (Spearman’s correlation coefficient = 0.187, P = 0.008, r 2 = 0.033).

Shape symmetry

There was a significant negative correlation between shape symmetry measurements and ‘Satisfaction with breasts’ scores; however, the correlation coefficient was also weak (Spearman’s correlation coefficient = −0.229, P = 0.001, r 2 = 0.079).

Objective volume symmetry and ‘Satisfaction with breasts’ sub-question ‘How equal in size your breasts are to each other?’

Due to the nature of ‘Satisfaction with breasts’ being a global outcome score, the 3D volume and shape symmetry were tested against participants’ perceptions using the sub-questions relating to volume and symmetry.

The median volume symmetry measurements differed between groups according to patient-reported score for ‘How equal in size your breasts are to each other’ (Kruska-Wallis test, P < 0.001). Post hoc pairwise comparison using Dunn–Sidak test further demonstrated that there were significant differences between participants’ scores for equality of size ‘very dissatisfied’ and ‘very satisfied’ (P = 0.046), ‘somewhat dissatisfied’ and ‘somewhat satisfied’ (P = 0.008), ‘somewhat dissatisfied’ and ‘very satisfied’ (P = 0.006) (Table 4). This indicates that participants who were ‘very satisfied’ with how equal in size their breasts are had significantly better volume symmetry than those who answered ‘very dissatisfied’ or ‘somewhat dissatisfied’.

Objective surface symmetry and ‘Satisfaction with breasts’ sub-question ‘How much your breasts look the same?’

The median shape symmetry measurements were significantly different according to patient-reported score for ‘How much your breasts look the same’ (Kruskal–Wallis test, P < 0.001). Post hoc pairwise comparison using Dunn–Sidak test further demonstrated that these differences arose between participants’ scores ‘very dissatisfied’ and ‘somewhat satisfied’ (P = 0.017), ‘very dissatisfied’ and ‘very satisfied’ (P < 0.001), ‘somewhat dissatisfied’ and ‘very satisfied’ (P = 0.002) and ‘somewhat satisfied’ and ‘very satisfied’ (P = 0.002) (Table 4). This indicates that participants who were ‘very satisfied’ that their breasts look the same had a greater objective symmetry compared to participants who rated it as ‘very dissatisfied’, ‘somewhat dissatisfied’ and ‘somewhat satisfied’.

Table 4 Volume symmetry according to ‘How equal in size your breasts are to each other?’ and shape symmetry according to ‘How much your breasts look the same?’

Discussion

This study shows that 3D measures of volume and shape symmetry agree with panel assessment and to a lesser extent with PROMs, indicating that 3D surface imaging could contribute to an objective measure of aesthetic outcome. Currently, the mainstays of analysis of aesthetic outcome are panel assessment of 2D photographs and objective scoring software using 2D photographs. Panel assessments have limitations in the time and manpower required. Furthermore, they are difficult to standardise and there is no assurance that, even if the same scale is used, two panels would score with identical strictness or leniency. Objective analysis offers a potential solution but 2D photographs may not provide the ‘whole story’ due to the very nature of analysing a three-dimensional object in two dimensions. For example, 2D photographs do not demonstrate the projection and cleavage well, whereas in 3D-SI, the image of the patient can be rotated to view the cleavage and allow the reviewer to see the patient as she sees herself.

Patients also provide insight into their own assessment of the aesthetic outcome with PROMs, and this is a key outcome measure. However, there may be many confounding factors in the patients’ responses due to their pre-morbid state and the psychological impact of diagnosis and treatment of breast cancer such that PROMs alone cannot be used in evaluation of new surgical or radiotherapy techniques. We have shown that 3D-SI can contribute to these evaluations.

In this study, there were only two significant pairwise comparisons of volume symmetry out of a possible seven when assessing its relationship with panel assessment. It is possible that surgery and radiotherapy lift the breast on the chest wall so that the panel assessment of symmetry is poor but the objective volume symmetry is relatively good. Similarly, a patient may have a focal deformity, drawing the reviewer’s attention resulting in a low panel score but the overall objective volume is similar to the unaffected breast. Finally, the low association found between panel and volume symmetry may also have been due to the panel method used as the Harvard classification focuses on symmetry and deformity of the breasts rather than volume.

There was a better association between objective shape symmetry and panel score with four significant pairwise comparisons. Shape symmetry encompasses volume as part of the whole, just as a panel will evaluate appearance more globally and the Harvard score focuses on symmetry so it is unsurprising that there was a better association with shape than with volume symmetry. Our findings are in keeping with those of Henseler et al. [38] who found a significant relationship between the mean subjective panel assessment score and objective symmetry (correlation coefficient −0.62) when assessing forty-four patients who had undergone unilateral extended latissimus dorsi flap reconstruction.

Volume symmetry and shape symmetry were both correlated with patient ‘Satisfaction with breasts’. However, these correlations were weak such that these objective measurements are unlikely to replace a patient’s perception of aesthetic outcome. The patient satisfaction domain of the BREAST-Q encompasses many aspects of how the patient feels about the aesthetic outcome and, as mentioned previously, many biases can confound a patients’ satisfaction other than pure aesthetic outcome. These results may also reflect the way in which the BREAST-Q module was developed and it could be that volume and symmetry had low weighting in the overall scoring compared to other factors, for example, how the patient feels about her appearance unclothed. To assess this further, we investigated the relationship between the sub-question ‘How equal in size your breasts are to each other’ and volume symmetry. There were three significant pairwise comparisons indicating that patients’ subjective assessment on a 4-point scale correlates well with objective assessment of volume symmetry. Sub-question ‘How much your breasts look the same’ demonstrated even better association with shape symmetry, with four pairwise comparisons. A similar study was undertaken by Yip et al. [39] where 119 women who had undergone immediate or delayed breast reconstruction underwent 3D-SI and answered the BREAST-Q post-operative reconstruction module. Unlike this study, they found no correlation between volume symmetry and ‘Satisfaction with breasts’ but they did find that patients were able to perceive volume difference on answering the sub-questions. The authors did not assess shape symmetry. The heterogeneity of their patient population may account for the difference in results between their study and ours.

We believe that 3D-SI will have an important role in the evaluation of breast surgery and radiotherapy in the future. As the technology evolves to become more portable and lower cost, [40,41,42,43,44] it will be available to more units treating patients with breast cancer, and could be used as a robust outcome tool in multicentre surgical and radiotherapy studies as well as in internal audits and quality assurance studies.

Conclusion

Breast volume and shape symmetry measured using 3D-SI are both associated with panel assessment of breast appearance and patient satisfaction with the cosmetic outcome. Shape symmetry, in particular, showed greatest association with panel assessment, such that it may be possible to replace this with an objective outcome score encompassing shape symmetry and other parameters measured using 3D-SI.