Main

Breast cancer is a heterogeneous disease comprising tumours with remarkably distinct clinical behaviour (Simpson et al, 2005). In the last 20 years, a multitude of prognostic and predictive markers have been tested, however only oestrogen receptor, progesterone receptor and HER2 are currently used to tailor the therapy of breast cancer patients (Goldhirsch et al, 2005). The use of expression arrays to derive molecular profiles that are predictive of clinical outcome has received great attention in the last 5 years (van’t Veer et al, 2005; Reis-Filho et al, 2006b), especially in the neoadjuvant setting (Davidson and Morrow, 2005). Predictive signatures for different neoadjuvant chemotherapy regimens have been reported by some investigators but with less success by others (Chang et al, 2003; Ayers et al, 2004; Cleator and Ashworth, 2004; Chang et al, 2005a; Hannemann et al, 2005; Iwao-Koizumi et al, 2005; Modlich et al, 2005; Rouzier et al, 2005; Dressman et al, 2006; Reis-Filho et al, 2006b). Although some of these results are promising, issues related to the instability of mRNA, experimental design and data analysis have led many to call into question the validity of current approaches (Ransohoff, 2004; Brenton et al, 2005; Ioannidis, 2005; Reis-Filho et al, 2006b).

Chromosomal comparative genomic hybridisation (cCGH) has been widely used to analyse the pattern of unbalanced genomic aberrations in breast cancer (Roylance et al, 1999; Buerger et al, 1999a, 1999b; Albertson, 2003; O’Connell, 2003; Reis-Filho et al, 2005a). More recent studies have employed higher resolution array-based CGH (aCGH) showing the enormous complexity of breast cancer genomes. Nonetheless, these studies have consistently reported the same regions of gain (1q, 8q, 11q, 17q, 20q) and loss (8p, 9p, 13q, 16q) (Rennstam et al, 2003; Loo et al, 2004; Naylor et al, 2005; Nessling et al, 2005). Unbalanced chromosomal aberrations and changes in gene copy numbers have been reported as a mechanism for acquired drug resistance to chemotherapy in cell line studies (Leyland-Jones et al, 1999; Yasui et al, 2004). However, no genome-wide analysis addressing response prediction by aCGH in breast cancer patients has been published.

The primary aim of this study was to determine whether aCGH could be used to identify unbalanced genomic changes predictive of response to preoperative doxorubicin and cyclophosphamide (AC) combination. We also sought to investigate the hypotheses that resistance to chemotherapy could be driven either by selection of chemotherapy resistant populations of neoplastic cells or by the induction of specific genomic aberrations.

Materials and methods

Patients and samples

Patients were recruited and treated at the Royal Marsden Hospital (Sutton and London, UK). Eligible patients were those undergoing neoadjuvant adriamycin and AC chemotherapy treatment at doses of 60 and 600 mg m−2, respectively, three weekly, for a clinically measurable breast cancer. Approval by the Royal Marsden Hospital Clinical Research and Ethics committees was granted and written consent was obtained in all cases. Patients were offered neoadjuvant treatment for one of several standard indications including locally advanced or inflammatory breast cancer, high tumour to breast size ratio and tumours located close to the nipple. The majority of the patients were from a previously published study on RNA expression profiling (Cleator et al, 2006).

Diagnosis was confirmed on core-cut biopsies subjected to routine histological analysis. Patients who demonstrated at least a partial clinical response received six cycles of treatment before local treatment. Patients with no or only marginal response after three or four cycles, proceeded directly to local treatment or were commenced on alternative systemic treatment (docetaxel).

Clinical size of tumour was recorded as the largest diameter and a diameter perpendicular to this. This was recorded before commencement and at completion of treatment. Clinical response was categorised as follows: no palpable abnormality after treatment, complete clinical response (cCR); >50% reduction in the product of the bidimensional measurements, partial response (PR); <50% reduction in the product of bidimensional measurements was recorded as no change (NC); residual ill-defined thickening after a good response, minimal residual disease (MRD) (Cleator et al, 2006). No patients in the study demonstrated progressive disease, an increase of more than 25% in the product of bidimensional measurements. Those cases in which there was no residual invasive carcinoma at surgery were classified as a complete pathological response (pCR). Good responders were defined as pCR, cCR, MRD; poor responders were defined as PR or NC. Eight patients undergoing a complete clinical and radiological (on ultrasound) response received radiation only as local treatment. Therefore some of the cCRs may represent undocumented pCRs.

Research 14-gauge core biopsies were collected before commencing treatment and again at 21 days after first treatment. Samples were snap frozen in liquid nitrogen. All samples were thereafter coded using a study number as an identifier. Frozen cores were embedded in optimum cutting temperature embedding compound and sectioned at −20°C in a cryostat. Sections (5 μm thickness) were taken for hematoxylin and eosin staining to assess histological character superficially from the core as soon as ‘full-face’ was reached. The percentage of cells comprising invasive malignant disease was recorded. For patients in whom multiple biopsies were available, that with the highest invasive content was used for microarray analysis. Samples with less than 20% invasive cancer content were excluded from the study. Eight micrometer thick, nuclear fast red-stained representative frozen sections from 44 patients with primary breast cancer obtained before a neoadjuvant combination of AC were microdissected. In 22 patients consent for a second research core needle biopsy was granted and the procedure was technically feasible; in 17 of these, samples obtained 3 weeks after first course of chemotherapy (before cycle two) contained proportions of neoplastic cells suitable for microdissection (i.e. >20% of neoplastic cells). Twenty-one surgical formalin-fixed paraffin-embedded resection samples taken after 4–6 cycles of AC and with sufficient amounts of residual neoplastic cells were retrieved from the pathology files of the Royal Marsden Hospital.

Fluorescent in situ hybridisation

Fluorescent in situ hybridisation (FISH) analysis was performed on representative 4-μm sections using PathVysion HER-2 DNA probe mixture containing a HER-2 DNA probe (190 kb Spectrum Orange directly labelled DNA probe) and the CEP 17 DNA probe (5.4 kb Spectrum Green directly labelled fluorescent DNA probe specific for the chromosome 17 α-satellite DNA sequence) as described previously (Dowsett et al, 2003). A total of 60 cells were scored for red and green signals for each section and results were expressed as a ratio of red to green signals, with a ratio 2.0 being considered amplified (Dowsett et al, 2003).

Microdissection and DNA extraction

Frozen and formalin-fixed samples were subjected to aCGH analysis after microdissection of neoplastic cells with a sterile needle under a stereomicroscope (Olympus SZ61) from one to three consecutive 8 μm nuclear fast red-stained sections (median=1). Estimated purity of tumour cells ranged from 75 to 100% (median=90%) (Supplementary Figure 1). DNA was extracted according to standard methods (Reis-Filho et al, 2005b). The DNA yield and purity was assessed by spectrophotometry and the DNA quality (DNA fragment size range) by agarose gel electrophoresis as described previously (Reis-Filho et al, 2005b).

Microarray-based comparative genomic hybridisation

Microarray-based CGH was performed with in-house produced microarrays containing 5.8K bacterial artificial chromosome (BAC) clones, spaced out throughout the genome in 0.9 Mb intervals. The Breakthrough Breast Cancer Research Centre CGH microarray is composed of the same BAC clones as described by Reis-Filho et al (2005b).

Mapping of the BAC clones was retrieved from public sources and positioned according to the May 2004 build of the human genome sequence (hg17). When genomic positioning was dubious or conflicting, BAC end pair sequencing and FISH mapping was performed. Clones that either (i) showed poor quality end sequences or (ii) hybridised to multiple chromosomal locations or to a cytogenetic location inconsistent with their position in the sequence assembly were excluded from analysis. Bacterial artificial chromosome clones were spotted in triplicate onto Corning GAPSII-coated glass slides (Corning, NY, USA). Labelling of 250 ng of non-amplified DNA obtained from microdissected frozen sections or 1000 ng of DNA retrieved from microdissected formalin-fixed paraffin-embedded tissue sections, hybridisation and washes were carried out essentially as described previously (Reis-Filho et al, 2005b, 2006a; Natrajan et al, 2006).

Arrays were scanned with a GenePix 4000A scanner (Axon Instruments Inc., Union City, CA, USA); fluorescence data were processed with GenePix 4.1 image analysis software (Axon Instruments Inc.) as described previously (Reis-Filho et al, 2005b, 2006a; Natrajan et al, 2006).

Data analysis

The log2 ratios were normalised for spatial and intensity-dependent biases using a two-dimensional Loess local regression. Experiment replicates (‘dye-swaps’) were collated, BAC clone replicate spots averaged, and clones excluded with poor reproducibility between replicates (s.d. >0.2). Samples with >70% missing/poor values were excluded, as were clones with missing/poor values in >70% samples. Missing data were imputed by k-nearest neighbours (Troyanskaya et al, 2001) and clones with no mapping information (May 2004 build of the human genome sequence, hg17) were excluded. A total of 5001 different BAC clones were used in the final analysis.

Statistical analysis

All data transformation and statistical analysis were carried out in R 2.0.1 (http://www.r-project.org/), BioConductor 1.5 (http://www.bioconductor.org/), making extensive use of modified versions of the package aCGH in particular (Paris et al, 2004; Natrajan et al, 2006) and S-Plus (version 6.2.1). The log2 ratio of each BAC clone in each tumour sample was centred by the median of each case. Thresholds for determining copy number gains and losses were determined as exceeding ±3.0 × s.d. of the mean of these control experiments (log2 ratio of ±1). For identification of DNA copy number alterations, data were smoothed using a local polynomial adaptive weights smoothing procedure for regression problems with additive errors (Hupe et al, 2004; Natrajan et al, 2006).

Associations between genomic loci were assessed by calculating Pearson's correlations between thresholded values for each clone, assigned as 1, 0 or −1 for gain, NC, or loss in copy number. Thresholded data for each clone were also used for categorical analysis using a Fisher's exact test adjusted for multiple-testing with a correction for multiple testing using the step-down permutation procedure maxT, providing strong control of the family-wise type I error rate.

Results

Patient, tumour and biopsy characteristics

A total of 44 patients were included in the study. Patient and tumour characteristics are summarised in Table 1 according to clinical response. Median tumour diameter was 4 cm (range 2–10). Two patients had inflammatory breast cancer (T4d). Median follow-up was 24 months (range, 9–38 months).

Table 1 Patient and tumour characteristics

Of the 44 patients, 24 (55%) demonstrated a ‘good’ and 20 (45%) a ‘poor’ clinical response. Four patients (9%) were documented as undergoing a pathological Complete Response (pCR), 17 (37%) underwent a cCR and three (7%) had ill-defined thickening (MRD) at end of treatment; eight (19%) had a PR and 12 (28%) showed NC.

Validation of aCGH method

DNA was extracted from 61 frozen samples. The median DNA yield obtained per sample was 660 ng (range 152–3188 ng). Owing to constraints in amount of tumour material available from frozen core biopsies after manual dissection, we assessed the reproducibility and quality of profiles obtained with 250, 500 and 1000 ng of tumour DNA. All DNA concentrations showed optimal results, with the Pearson's coefficient of correlation ranging from 0.93 to 0.96 (data not shown). Biological replicates of aCGH experiments also demonstrated high reproducibility: two independent research core biopsies were obtained before neoadjuvant chemotherapy for five patients. DNA extraction and CGH arrays were performed separately for these paired samples. For each of the matched pairs, the Pearson's correlation coefficient ranged from 0.64 to 0.86 (mean 0.78, data not shown). In six cases, HER2 amplification was defined by aCGH. Fluorescent in situ hybridisation analysis confirmed the results in all cases, providing further evidence to support the validity of the aCGH analysis methods employed in this study (data not shown).

For six cases, high-resolution cCGH was performed and the genetic profiles compared with those obtained with aCGH. Correlation was good-to-excellent, with correlations for low-level gains and deletions >20 Mb and any amplification ranging from 60 to 87.5% (median=79.5%, mean=77.8%, data not shown). However, aCGH was more sensitive in detecting small losses and low-level gains than HR-CGH.

Genomic alterations in 44 pre-chemotherapy breast cancer samples

To identify genomic regions harbouring recurrent unbalanced genomic changes, we plotted the frequency of tumours showing gain or loss for each BAC across the genome (Figure 1A). The most frequent (>30%) genomic changes (Table 2) comprised gains of 1q (66%), 5p (32%), 8q (70%), 16p (36%) and 20q (41%) with the smallest regions of genomic gain on 1q31.1–1q31.2 and 1q22–1q25.3, 5p15.31–5p15.33, 8q23.1–8q25.1, 16p11.2–16p12.2 and 20q13.13–20q13.33, respectively. Losses were observed on 4q (39%), 8p (50%), 9p (36%), 11q (32%), 13q (36%), 16q (52%) 17p (50%) and 18q (39%) with the smallest regions of deletion on 4q32.3–4q33, 8p21.1–p23.3, 9p22.2–9p24.3, 11q23.1–11q25, 13q14.11–13q14.3, 16q23.1–q24.3, 17p12–17p 13.3 and 18q22.1–q23. In addition to the large regional alterations, the resolution of the BAC array allowed us to map smaller regions of gain or loss. Bacterial artificial chromosome clones gained or deleted in >30% of the tumours are described in Supplementary Tables 1 and 2, respectively.

Figure 1
figure 1

Frequency of copy number changes in 44 invasive breast carcinomas. (A) Overall frequency of DNA copy number alterations found in 44 invasive breast carcinomas as defined by aCGH. The proportion of tumours in which each clone is gained (green bars) or lost (red bars) is plotted (y axis) for each BAC clone according to genomic location (x axis). Vertical dotted lines represent chromosome centromeres. (B) – Pearson's correlation matrix of all thresholded aCGH data for 29 ER-positive, invasive breast carcinomas. Strong positive correlations are plotted as dark red, strong negative correlations as dark blue. Note the associations between chromosomes 1p/8p, 1p/12q, 4q/8p, 7p/22, 8q/12q and 11q/12q. Bacterial artificial chromosome clones are plotted in genome order. (C) Pearson's correlation matrix of all thresholded aCGH data for 15 ER-negative, invasive breast carcinomas. Strong direct plotted as dark red, strong negative correlations as dark blue. Note the associations between chromosomes 1p and 7q, 16p and 17q, and strong inverse correlations between 1q and 14q, 2q and 9q, 4q and 15q, 6q and 11q, 7p and 11q, 7p and 12q, 9p and 16p, and 13q and 15q. Bacterial artificial chromosome clones are plotted in genome order.

Table 2 Recurrent gains and losses of genomic material in >30% of the samples

Comparison of genomic alterations in different phenotypes of breast cancer

On the basis of three distinguishing phenotypic characteristics (ER, HER2 and histological type), we characterised genetic alterations that might be associated with subtypes of breast cancer on the 44 pretreatment biopsies. Oestrogen receptor (ER)-negative tumours (N=15) were characterised by significantly more chromosomal changes than ER-positive tumours (Table 3 and Supplementary Figure 2). Subgroup analysis showed that the following chromosomal changes were more frequently associated with ER negativity: gains on 1p, 2p, 2q, 6q, 7q, 9p, 11q and 19q with large regions on 1p31–1p34.2, 7q32–7q36 and losses on 4p, 4q, 5p, 5q, 12q and 17q as well as large regions such as the whole long arm of chromosome 5 and whole chromosome 4.

Table 3 Copy number changes significantly more prevalent in ER-negative compared to ER-positive tumours

Pearson's correlation analysis of the subset of ER-positive tumours (n=29) revealed chromosomal alterations that coexist in this group of tumours. The heat map in Figure 1B shows regions of positive (change in same direction: red) and negative (change in opposite direction: blue) correlations. Relatively large regions of correlation were seen between changes in copy number of BACs on chromosome 1p and 8p (P<0.01), 3p and 4p (P<0.001), 4q and 8q (P<0.01) and 8q and 12q (P<0.01) were observed, as were strong inverse correlations between 1p and 12q (P<0.0005), 7p and 22 (P<0.001) and 11q and 12q (P<0.001) (Figure 1B). Pearson's correlation matrix of ER-negative tumours revealed strong direct associations between 1p and 7q (P<0.001), 16p and 17q (P<0.001), and strong inverse correlations between 1q and 14q (P<0.005), 2q and 9q (P<0.001), 4q and 15q (P<0.001), 6q and 11q (P<0.001), 7p and 11q (P<0.001), 7p and 12q (P<0.01), 9p and 16p (P<0.001) and 13q and 15q (P<0.005) (Figure 1C).

Comparing invasive lobular carcinomas (n=7) with non-lobular cases (37 cases), 16q loss (from 16q12.1 to 16q24.3) was significantly associated with lobular phenotype (unadjusted Fisher's exact test <0.01) (Supplementary Table 3 and Supplementary Figure 3). This region contains a number of cadherin genes (CDH1, CDH3, CDH5, CDH 8, CDH 11). Gain of 8q21.11–q24.23 was significantly associated with the ductal phenotype (Supplementary Table 3 and Supplementary Figure 3).

Comparison between the six cases with HER2 amplification on 17q11.2 with the 38 HER2-negative cases revealed more frequent gains of 6p25.2–p24.2, 17q12 and Xp22.33–p11.36, and loss of 18p11.31 (Supplementary Table 3 and Supplementary Figure 4). On the other hand, loss of 16q23.3–q24.1 was more frequently observed in HER2 – tumours. This would be expected given that all HER2+cases were of histological grade III and loss of 16q is significantly less frequently found in this group of tumours (Reis-Filho et al, 2005a; Simpson et al, 2005).

Prediction of tumour response

Comparison of the molecular genetic profiles of objective clinical responders (complete and partial clinical responders, N=24) with those of non-responders revealed loss of a 13.3 Mb region on 13q31.1–13q32.2 to be the only significant difference (Figure 2). This region includes the genes SLITRK6, SLITRK5, GPC5, GPC6, DCT, TGDS, SOX21, ABCC4, CLDN10, DZIP1, DNAJC3, UGCGL2, HS6ST3, HSP90AB6P, OXGR1, MBNL2 and RAP2A. In addition, this region also encompasses the micro RNA cluster miR-17-92, which is reported to induce tumour cell growth and to be overexpressed and sometimes amplified in aggressive forms of lung cancer (Hayashita et al, 2005).

Figure 2
figure 2

Categorical analysis of copy number gains and losses between non-responders (N=20) and responders (N=24). Fisher's exact tests are carried out on the segmented values for each clone, and those with a P-value of less than 0.01 are plotted (inverse log10, y axis) according to genomic location (x axis).

Comparison of pre-chemotherapy samples to D21 samples, before cycle two of AC

Unsupervised hierarchical clustering analysis based upon genetic alterations on all chromosomes showed that all pre- and D21 post-biopsies clustered together (N=17 cases) (Figure 3A). There were no significant differences in the profiles between poor (10 cases) and good responders (seven cases) (P=0.95, Kolmogorov–Smirnov test). The comparison of the pattern and frequency of unbalanced genomic changes detected by aCGH analysis in matched pretreatment and D21 biopsies revealed no significant differences (multi-Fisher's exact test, data not shown).

Figure 3
figure 3

Hierarchical clustering analysis (Ward's method/Euclidean distance) of matched pre- and post-chemotherapy samples (17 patients) (A) and pre-chemotherapy, post-chemotherapy and surgical samples (12 patients) (B). (b – before chemotherapy; a – after chemotherapy (day 21); s – surgical biopsy).

Comparison of pre-chemotherapy samples to surgical samples, after 4–6 cycles of AC

Twenty-one surgical samples out of 44 cases were available (48%). All but three segregated with the respective pre-chemotherapy sample by unsupervised hierarchical clustering (Figure 3B). For 12 cases, pre-, D21 and at surgery samples were available. Unsupervised hierarchical clustering analysis based upon genetic alterations on all chromosomes was performed. All 12 ‘triplicates’ clustered together.

Comparison between the genomic profiles obtained with DNA extracted from matched frozen and formalin-fixed pretreatment core biopsies were performed (n=2). The profiles showed a good concordance (r2>0.8). Subsequently, a comparison of the aCGH profiles of 21-matched pretreatment core biopsies and the respective excision specimens obtained after 4 months of neoadjuvant chemotherapy revealed the presence of gains of 4q13.1 (9/21, 43%), 11p15.2–p15.5 (14/21, 66%), 12q13.3 (14/21, 66%), 18p11.21 (12/21, 52%) and 19q13.2 (12/21 52%) in the excision specimens (Figure 4 and Supplementary Table 4). After FDR adjustment, only the gain on 11p15.2–p15.5 retained its statistical significance. Although we cannot completely rule out that this change could result from the comparison between DNA extracted from frozen and formalin-fixed samples, this 14.3 Mb telomeric region encompasses interesting oncogene candidates, such as H-RAS, a bona fide oncogene, and IGF2, which has been reported to show loss of imprinting in up to 30–60% of breast cancers (McCann et al, 1996; Wu et al, 1997). In the pre-chemotherapy samples, gain on 11p15.2–p15.5 was observed in five out of 10 good responders and in two out of 11 poor responders (not significant, Fisher's exact test).

Figure 4
figure 4

Categorical analysis of copy number gains and losses between matched pre-chemotherapy and surgical tumour samples (21 patients). Fisher's exact tests are carried out on the segmented values for each clone, and those with a P-value of less than 0.01 are plotted (inverse log10, y axis) according to genomic location (x axis).

Discussion

In the present study, the frequency and pattern of unbalanced genomic aberrations were similar to those described in previous studies where genome-wide changes were analysed by means of chromosomal CGH (Tirkkonen et al, 1998; Buerger et al, 1999a, 1999b; Roylance et al, 1999; Seute et al, 2001; Rennstam et al, 2003) and aCGH (Callagy et al, 2005; Naylor et al, 2005; Nessling et al, 2005). Recurrent gains on chromosome 1q, 8q, 11q, 17q and 20q and losses on 6q, 8p, 9p, 13q and 16q were the most prevalent changes. In addition, we confirmed the association between ER positivity and gain of 1q coupled with loss of 16q (Farabegoli et al, 2004; Reis-Filho et al, 2005a; Simpson et al, 2005) and the more prevalent deletions of 4p16 and 4p15, 5q and 17p11.2 in ER-negative tumours (Loo et al, 2004). In contrast to previous studies (Loo et al, 2004), gains of 8q24.1 (MYC) and 17q12 (HER2) were not significantly more frequent in ER-positive tumours. This is expected, given that only breast carcinomas of histological grades 2 and 3 were present in the population and gains of 8q are rather frequent in grade 2 and 3 breast cancers (Buerger et al, 1999a, 1999b; Roylance et al, 1999).

The comparison between the genomic profiles obtained for ductal and lobular carcinomas were also in agreement with previous studies (Buerger et al, 1999b; Shelley Hwang et al, 2004; Reis-Filho et al, 2005a; Simpson et al, 2005; Stange et al, 2006): gain of 1q and deletions of 16q were the most prevalent changes in lobular carcinomas, whereas gain of 8q was significantly more frequent in grade 2 and 3 ductal carcinomas. However, we could define the smallest region of overlap of the deletions of 16q, which mapped to 16q21–q22.1 and encompassed the region of the cadherin gene cluster, and the gain of 8q, which encompassed two regions 8q13.2–q21.13 and 8q21.3–qtel (Supplementary Table 3). The most significant gain comparing ductal carcinoma with lobular carcinomas mapped to 8q24.11 (117.8–118.0 Mb), which encompasses RAD21 and eucaryotic translation initiation factor 3, subunit 3 gamma (EIF3S3). RAD21 is believed to function in sister chromatid alignment as part of the cohesin complex and also in double-strand break repair and influences cellular proliferation (Atienza et al, 2005), whereas EIF3S3 is reported to be amplified and overexpressed in up to 20% of breast carcinomas (Nupponen et al, 1999). Our observations are consistent with previous studies on breast cancer, confirming the robustness of our aCGH protocol, the validity of our analysis method and the likelihood that we have a representative set of tumours.

Several attempts have been made to predict clinical or pathological response to neoadjuvant chemotherapy in breast cancer using gene expression arrays. Gene ‘signatures’ or predictors have been devised for several chemotherapy regimens, including paclitaxel followed by fluorouracil, AC, AC/doxorubicin-docetaxel and taxane only chemotherapy (Chang et al, 2003, 2005b; Ayers et al, 2004; Hannemann et al, 2005; Cleator et al, 2006). Although these results are promising, the exceedingly small sample size and limitations with the current technology and analysis methods have so far precluded definitive conclusions (Brenton et al, 2005; Reis-Filho et al, 2006b).

We have recently reported a similar expression profiling study (Cleator et al, 2006) that included the 44 patients studied here plus a small number of others for whom pretreatment tissue for DNA analysis was not available. Neither unsupervised nor supervised methods could separate the responders from non-responders. In the current study, using the same cohort of patients, which included 24 good clinical responders, aCGH analysis revealed a deletion of a large region from 13q31.1 to 13q34 as the only significant copy number change associated with response to chemotherapy. Although this statistical association lost its significance after correction for multiple comparisons, the correction method, which we adopted may be too conservative, given that unbalanced chromosomal aberrations usually encompass more than one BAC clone. One of the genes deleted in this region is a subtype of heat-shock protein 90, HSP90AB6. HSP 90 is a molecular chaperone whose association is required for stability and function of multiple signalling proteins that promote cancer cell growth and/or survival (Chen et al, 2005). Further studies with a larger sample size focusing on this particular region are warranted. We cannot exclude the possibility that analysis similar to ours of a larger series of samples might identify other DNA aberrations of importance for response to AC chemotherapy. It is however likely that if these were present in a large proportion of the population they would have been uncovered by the present analysis.

Sequential sampling of tumour during neoadjuvant treatment can be used to detect gene expression modification induced by therapy as we have demonstrated with endocrine therapy (Mackay et al, 2005). This was demonstrated for chemotherapy by Hannemann et al (2005), who observed that tumours that responded to neoadjuvant chemotherapy showed dramatic changes in their expression profiles when compared to the changes observed in non-responders (Hannemann et al, 2005). On the other hand, a comparison between the transcriptomic profiles of tumours subjected to taxane-based neoadjuvant chemotherapy before and 3 months after treatment revealed strikingly different patterns, independent of initial sensitivity or resistance (Chang et al, 2005b).

Cell line studies have demonstrated that changes in gene copy numbers may lead to acquired resistance to chemotherapy (Leyland-Jones et al, 1999; Shimizu et al, 2002; Yasui et al, 2004). Specific gains of genetic material mapping to multi-drug resistance (MDR) gene MDR1 locus have been reported in drug resistant cell lines (Shimizu et al, 2002; Kuwano et al, 2003). Recurring amplicon 7q11.2–q21 identified by CGH in doxorubicin-resistant hepatocellular carcinoma cell lines coincided with the localisation of MDR1 (Pang et al, 2005). Induction of DNA damage response genes such as p21 were obtained by doxorubicin in breast cancer cell lines (Troester et al, 2004). Both chromosome 7 alterations and several cytogenetic changes involving the 7q21 locus are associated with the development of MDR in sarcoma cells (Chen et al, 2002). Analysis of genomic amplifications and deletions revealed specific genetic alterations common to both intrinsic and acquired doxorubicin resistance including ABCB1, PGY3 (ABCB4) and BAK (Turton et al, 2001).

Most patients show some tumour shrinkage with neoadjuvant chemotherapy and for many there may be profound loss of malignant cells with some showing a pCR. Characterisation of the residual cells may be expected to a better understanding of the causes of resistance and allow the identification of the means to overcome the resistance. Given the molecular heterogeneity between breast carcinomas and the composition of most breast cancers being of multiple non-modal clones, we hypothesised that the cell loss might lead to selection of breast cancer cell lineages that were resistant to chemotherapy by virtue of specific DNA alterations. Few studies have previously addressed this issue. A very small study, involving CGH of just four tumours from breast cancer patients after neoadjuvant chemotherapy (Fazeny-Dorner et al, 2003) showed typical DNA imbalances for ductal breast cancer. Three patients showed involvement of several regions bearing genes of drug resistance (MDR1, BCRP, MRP1, RFC1); the fourth patient displayed an amplification in the region of MYC. In our study, the hierarchical clustering of sequential samples at baseline, 3 weeks and at surgery indicated that any changes induced in DNA profiles by therapy were modest compared with the differences that were present between the patients. This interpretation was supported by the observation that statistically significant chemotherapy-driven genomic changes were not detected within 3 weeks (i.e. after a single course of chemotherapy). Comparison of the pretreatment and the tumour specimens excised after 4 months of neoadjuvant chemotherapy revealed the appearance of genomic gains on 11p15.2–p15.5. Given the experimental design of our study, we cannot define whether these amplifications were chemotherapy induced or were present in a non-modal population of the primary tumour and selected by chemotherapy, as the majority of surgical samples by definition were obtained from poor responders (66%). Interestingly, pathological relaxation of the imprinting pattern in this region is reported to be found in 30–60% of breast cancers (McCann et al, 1996; Wu et al, 1997). However we did not find a correlation between amplification on 11p15.5 and clinical response (Han et al, 2006). Furthermore, these changes on surgical samples, need to be cautiously interpreted as surgical samples were subjected to different fixation conditions (Devries et al, 2005). Thus in contrast to observations with cancer cell lines, we did not find large or frequent chemotherapy-induced acquired genomic changes. Given that aCGH provides an average of the pattern of genomic gains and losses in the cell population studied, chemotherapy-driven losses or low-level gains occurring in non-modal clones are unlikely to be detected.

In conclusion, array CGH is a powerful method for the genome-wide detection of chromosomal imbalances and allowed us to detect molecular genetic aberrations associated with specific breast cancer subgroups (ductal vs lobular, ER-negative vs ER-positive tumours). A molecular genetic profile specific of good responders to neoadjuvant chemotherapy was not detectable in our series. Chemotherapy-driven genomic changes were not detected following 3 weeks of treatment and only a single change after completion of treatment. The hypothesis of resistance to neoadjuvant chemotherapy by the selection of non-modal cell lineages, which differ by gene amplifications or losses is not supported by our results.