Introduction

To date, the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has infected over 46 million people in the United States, resulting in over 747,000 deaths.1 Although COVID-19, the disease caused by SARS-CoV-2, is typically mild in children compared with adults, severe disease and death have been reported in newborns, infants, and young children.2,3 Multisystem inflammatory syndrome in children can occur even after the resolution of infection and has disproportionately affected ethnic minority children.4 Therefore, protecting the vulnerable infant and toddler population from SARS-CoV-2 is critical. One potential mechanism for protection is through passive immunity via breastfeeding from a mother previously infected with SARS-CoV-2.

Breast milk contains antibodies in response to infections that provide passive immunity, along with other bioactive factors such as lactoferrin.5,6,7,8,9 Exclusive breastfeeding substantially reduces respiratory illness in infancy and beyond.10,11,12,13 Continuous consumption of human milk containing pathogen-specific IgA coats the mouth, throat, and gut, preventing the establishment of infection, and is a likely component in the mechanism of protection.14 Approximately 90% of the antibodies found in breast milk are IgA and 8% are IgM, predominately in secretory form (sIgA/sIgM), which helps protect the antibodies from the harsh environments of the infant mouth and gut. The remaining 2% of antibodies are IgG, which are derived from serum.15

Early in the pandemic, the presence of both IgA and IgG antibodies to the SARS-CoV-2 spike (S) protein, the receptor-binding domain (RBD) of S and the nucleocapsid (N) protein was confirmed in breast milk from two previously infected women.16,17 Similarly, milk from 12 of 15 infected women contained RBD-specific IgA.18 The largest study to date examined 37 breast milk samples from 18 infected women and found that 76% of the milk samples contained SARS-CoV-2-specific IgA and 80% had IgG.19 In addition, the concentrations of SARS-CoV-2 IgA were consistently higher than IgG, which confirms the earlier report of Favara et al.17,19 Gao et al. were the first to report the presence of SARS-CoV-2 IgM in milk samples from 3 of 4 infected women.20 Antibody profiles in women with varying severity of COVID-19 symptoms have not been reported. The primary objectives of this study were to establish the presence of SARS-CoV-2-specific IgA and IgG and to characterize the antigenic regions of SARS-CoV-2 proteins that react with breast milk antibodies from women with confirmed SARS-CoV-2 infection.

Methods

Participants and breast milk sample collection

Breast milk samples and clinical information were obtained from women participating in the Mommy’s Milk Human Milk Biorepository at the University of California, San Diego. Women residing in the US who had a confirmed SARS-CoV-2 infection by RT-PCR were enrolled. Demographics, health history, illness and exposure dates, symptoms, and SARS-CoV-2 test results were collected by telephone interview. Participants self-collected breast milk samples using a provided collection kit including instructions for expressing and storing samples. Participants who had recovered from their illness at the time of the study interview were asked to ship any frozen samples previously collected at the peak of their symptoms in addition to a fresh milk sample. Fresh samples were shipped on ice within 24 h of collection to the Biorepository and stored at −80 °C prior to shipment on dry ice to Antigen Discovery, Inc.

Protein microarray analysis of breast milk samples

A multi-coronavirus protein microarray, produced by Antigen Discovery, Inc. (ADI, Irvine, CA, USA), included 935 full-length proteins, overlapping protein fragments and overlapping 13–20 aa peptides from SARS-CoV-2 (WA-1), SARS-CoV, Middle East respiratory syndrome coronavirus (MERS-CoV), human coronavirus (HCoV)-NL63 and HCoV-OC43. Proteins were expressed using an Escherichia coli in vitro transcription and translation (IVTT) system (Rapid Translation System, Biotechrabbit, Berlin, Germany). Included on the array were four structural proteins and five accessory proteins of SARS-CoV-2: spike (S, divided into S1 and S2 regions), envelope (E), membrane (M), nucleoprotein (N), open reading frames (ORFs) 3a, 6, 7a, 8 and 10. Fragments of these nine proteins were made through IVTT in 50% overlapping segments of 30 aa, 50 aa, and 100 aa. There were also structural proteins produced by IVTT for MERS-CoV, HCoV-NL63, and HCoV-OC43. Full-length SARS-CoV-2 S protein and the RBD were included as purified proteins, plus overlapping 13–20 aa peptides of the SARS-CoV (2002 SARS epidemic) structural proteins and the S proteins of MERS-CoV, HCoV-NL63 and HCoV-OC43 (Supplementary Table S1). Purified proteins and peptides were obtained from BEI Resources. SARS-CoV-2 and SARS-CoV S proteins were made in Sf9 insect cells, and SARS-CoV-2 RBD was made in HEK-293 cells. IVTT proteins, purified proteins, and peptides were printed onto microarray slides and probed with whole breast milk samples at a 1:15 dilution for detection of IgA binding, as previously described and detailed in the Supplementary materials.21 The most IgA-reactive breast milk sample per patient was also probed at a 1:5 dilution for detection of IgG binding.

ECLIA analysis of SARS-CoV-2-specific IgA and IgG in breast milk

V-PLEX COVID-19 Coronavirus Panel 2 multiplex electrochemiluminescence immune assay (ECLIA) kits were purchased from Meso Scale Discovery (Rockville, MD) to measure IgA and IgG antibodies against four SARS-CoV-2 antigens in whole breast milk samples. Antigens included were: S, RBD, N-terminal domain (NTD) of S, and N. Whole breast milk samples were diluted 1:10 and 1:100 for IgG and IgA assays, accordingly. Details of the assay are included in the Supplementary materials.

Statistics

Maternal and infant characteristics were presented as means and standard deviations. Categorical variables were expressed as counts and percentages. Missing values were excluded. R version 4.1.0 was used for the description of maternal and child characteristics.

For protein array and ECLIA results, “reactive antigens” were defined, post hoc after observing the heterogeneity in mothers’ responses and modeling negative and positive signal distributions using mixture models (Supplementary Fig. S1).22 IgA reactivity for IVTT cell-free expressed proteins was defined as a normalized signal intensity greater than 1.0, equivalent to two times background, at any sample in at least one study participant. IVTT IgG reactivity was defined as normalized signal intensity greater than 2.0, or four times background. Reactivity cutoffs for array purified proteins, peptides, and ECLIA proteins for IgA and array peptides for IgG were established with the mixture models, whereas arbitrary cutoffs were set for array purified protein and ECLIA IgG signals (Supplementary Fig. S1D, H, respectively).

Clinical variables were associated with SARS-CoV-2-specific IgA antibodies using multivariable linear mixed effects regression (LMER) to model antibody responses to each SARS-CoV-2 protein or fragment with random intercepts at the subject level to adjust for repeated measures. LMER models were fit with clinical factors at the time of sample collection, including days since symptom onset, presence of COVID-19 symptoms, number of symptoms, number of days symptomatic, maternal age (years), and baby’s age (months) as fixed effects variables. All coefficients were returned from models fit using restricted maximum likelihood (ML). To generate P values for LMER models, the models were refit using ML and compared by ANOVA against null models with the coefficient removed using ML. For IgG responses, ordinary least squares models were fit with clinical factors, since a single sample for each study participant was assayed. P values were adjusted for the false discovery rate.23 Data visualization was performed using the circlize, ComplexHeatmap, and ggplot2 packages in R.24,25,26

Results

Maternal characteristics

Between March 14, 2020, and September 1, 2020, 21 women who had a confirmed SARS-CoV-2 infection by RT-PCR were enrolled. The majority of women were white (85.7%) and all non-Hispanic (Tables 1, 2). Maternal age at enrollment averaged 34.49 years (SD 3.67) and child age at enrollment averaged 10.17 months (SD 5.45). Five of the women (23.8%) had a body mass index ≥30, and 9 women (42.9%) had underlying health conditions, including asthma, hypertension, diabetes, heart conditions, kidney conditions, hypothyroidism, hyperthyroidism, or irritable bowel disease. All women were symptomatic for COVID-19, two of whom were hospitalized. An average of 9.57 (SD 4.02) symptoms were presented and lasted an average of 25.14 days (SD 15.82). Milk samples were collected at the time of onset of symptoms, and an additional one to twelve samples were taken at a range of 1–231 days post-onset of symptoms (mean number of samples collected per woman: 4.76 [SD 2.95; range 2–13]; mean number of days symptomatic: 25.14 [SD 15.82, range 10–91]). Almost one-quarter of the breastfed infants (23.8%) had respiratory symptoms, including runny nose, congestion, and fever, but were not tested for SARS-CoV-2. These infants were dyads of mothers #22, #42, #48, #60, and #63, all of whom recovered at home without medical treatment. Three (18.8%) asymptomatic infants were tested for SARS-CoV-2 and one child, the infant of mother #14, was positive.

Table 1 Characteristics of women who tested positive for SARS-CoV-2 and their breastfed children, N = 21 women and 22 children.
Table 2 Specialized profiles for women #14, 26, 52, 56 with unique antibody response to SARS-CoV-2.

Breastfeeding mothers vary in the production of milk IgA against SARS-CoV-2 proteins and recognition of specific antigenic regions

Breast milk samples tested on the multi-coronavirus protein microarray (Supplementary Table S1) and by ECLIA contained IgA reactive with a variety of SARS-CoV-2 antigens, as well as antigens from other HCoVs (Figs. 1 and 2). A total of 24 IVTT-expressed SARS-CoV-2 full-length or fragmented proteins had IgA reactivity above the reactivity threshold in at least one study patient. The most IgA-reactive SARS-CoV-2 proteins were N (9/21 responded to at least one N fragment) and S proteins (5/21 for S1 or S2), and one patient exhibited strong IgA reactivity with M protein. Seropositivity rates to purified recombinant proteins on the arrays and by ECLIA were higher than for IVTT proteins, likely due to their higher concentrations: 19/21 for N by ECLIA, 18/21 for S by both protein array and ECLIA, 2/21 for the RBD by array and 16/21 by ECLIA, and 12/21 for the NTD by ECLIA. One patient responded to ORF3a, and another patient responded to ORF7a (Fig. 2). IgG seropositivity rates for IVTT proteins and array purified proteins were similar to IgA, but with more N responders (12/21), whereas N responders by ECLIA were fewer than for IgA (12/21) (Supplementary Fig. S2). These profiles are similar to our recent observations of SARS-CoV-2-specific IgG in symptomatic COVID-19 patients, where most patients responded to S and N proteins with the exception of a minor subset of “serosilent” or “serodelayed” patients.21,27 Moreover, the levels of reactivity seen in this study are in agreement with recent studies of maternal serum and cord blood IgG in SARS-CoV-2 infected pregnant women, as well as the reactivity of breast milk IgA from SARS-CoV-2 infected mothers with SARS-CoV-2 N, S and RBD.19,28

Fig. 1: Reactivity of individual COVID-19 patient breast milk IgA to SARS-CoV-2 proteins.
figure 1

a The circular graphic maps the amino acid (aa) position of SARS-CoV-2 fragments, showing a heatmap of antibody levels for each individual mother for overlapping regions of different aa length. Proteins are indicated outside the circle plot above an axis that shows aa positions from the N-terminus to C-terminus of each protein. The following line graph shows the sequence homology of other HCoVs with SARS-CoV-2 for each gene. The inner circular heatmap shows proteins and protein fragments produced in cell-free E. coli in vitro transcription and translation reactions by bars that represent length and position of each fragment in each protein. Full-length, 100 aa and 50 aa fragments are shown. Fragments of 30 aa size were mostly non-reactive and are not shown but are included in the full data sets (see Supplementary Data). Each fragment is drawn 21 times, once for each COVID-19 maternal patient ordered by subject ID number, and the colored bars show the normalized signal intensity (SI) of antibody binding to each fragment. Only one data point per subject, per fragment is shown, representing the maximum SI measured among all of their respective breast milk samples taken after onset of symptoms. IgG signal intensity is shown by color gradient (gray to red). Seroreactive regions of the proteins are highlighted by magenta outlines. The inner circle bands represent the responses to full-length purified recombinant S protein (shown crossing both S1 and S2 regions) and the receptor-binding domain (RBD) of S protein from the array. This is followed by responses acquired in the electrochemiluminescence assay (ECLIA) to the full-length S and N proteins, the N-terminal domain (NTD) of S protein and the RBD of S protein. b The zoomed cutout of the circle graphic includes additional labeling for clarity, including fragment length labels on the left of the heatmap and subject ID labels to the right.

Fig. 2: Heatmap depicting relative IgA antibody responses to SARS-CoV-2 as compared to other HCoVs and clinical data.
figure 2

The heatmap presents the signals of antibody binding to individual proteins and protein fragments within the antigenic regions of SARS-CoV-2, as well as the full-length structural proteins of MERS-CoV, HCoV-NL63 and HCoV-OC43, for individual samples. Columns represent breast milk samples, and rows represent proteins or protein fragments: 26 SARS-CoV-2 proteins or fragments filtered for having a maximal normalized log2 signal intensity of at least 0.5 in one or more mother’s samples, and five proteins each of MERS-CoV, HCoV-OC43 and HCoV-NL63. Antibody signal intensity is shown on a color scale from gray to red. Log2 signal intensities from recombinant purified proteins on the array and log2 signal intensity from proteins assayed on the ECLIA platform are overlaid above the array cell-free expressed proteins and shown with independent gray-to-red color scales. Sample clinical information is overlaid above the heatmaps and includes categories at time of sampling for COVID-19 symptoms, number of symptoms, number of days symptomatic, baby’s symptoms (presence or absence of respiratory symptoms, or asymptomatic if testing positive for SARS-CoV-2 “+/Asymptomatic”), maternal age and baby age. Protein/fragment information is annotated to the left of the heatmaps and includes the virus, full-length protein name and the amino acid length of the protein fragments (“AA Length”, as full length, 100, 50, or 30 aa).

Responses to reactive regions of SARS-CoV-2 structural proteins were heterogeneous (Fig. 1). The C-terminal region of S1 spanning aa 551–650 was recognized by only mother #14 (Fig. 2). Mothers #47 and #52 had milk IgA that recognized the full-length S2 IVTT protein, but mother #47 reacted with a fragment spanning aa 1–100, whereas mother #52 reacted with a fragment spanning aa 51–150. Both mothers responded to what are likely unique epitopes within the first 150 aa of S2. Mother #14, however, responded to neither of the first 100 aa fragments of S2 nor the full-length IVTT protein, but responded specifically in the regions of aa 201–300 and aa 451–550. Mother #60 had unique IgA reactivity to the aa 401–500 fragment. For N protein, mother #14 responded to multiple fragments, whereas most N-seropositive mothers responded only to the full-length protein. Mother #26 had a seropositive IgA response only to the aa 251-350 fragment of N protein, which was non-reactive in all other mothers. The IgG levels to SARS-CoV-2 proteins were similarly heterogeneous (Supplementary Fig. S2). In another recent study, variation in the SARS-CoV-2 IgG and IgA responses in COVID-19 mRNA-vaccinated or infected pregnant and lactating women was higher in the milk than in the serum.29 Although antibodies were only tested against the RBD, the study by Collier et al., along with our recent profiling studies in COVID-19 patient sera, indicate that the mucosal response to SARS-CoV-2 in breast milk may be more variable than the systemic response measured in serum.27

Breastfeeding mothers vary in the production of milk IgA against proteins of other human coronaviruses

All 21 of the subjects we studied had IgA antibodies that were reactive with the N and/or S proteins of one or both endemic HCoVs, HCoV-NL63, and HCoV-OC43, but the reactivity was less than two-fold over background in twelve of the mothers (Fig. 2). Ten of the patients also showed IgA reactivity with MERS-CoV N and/or S proteins. Sixteen of the patients demonstrated a likely cross-reaction of antibodies directed at SARS-CoV-2 proteins with orthologous proteins of one or more of the other three coronaviruses on the array. This is indicated by stronger reactivity with the SARS-CoV-2 protein, particularly for patients 14, 56, 59, and 6 and for patient 52 by the acquisition of strong immunity to SARS-CoV-2 midway through the time-course of sample acquisition with concomitant appearance of stronger reactivity with MERS-CoV, HCoV-NL63 and HCoV-OC43 antigens. In contrast, IgA from patients 11, 20, 24, and 26 exhibited strong reactivity with the N proteins of HCoV-NL63 and/or HCoV-OC43 without demonstrating strong reactivity with SARS-CoV-2 N. These patients likely had preexisting IgA directed to one or both HCoV and had not yet responded strongly to SARS-CoV-2.

SARS-CoV-2 protein antibody levels were not associated with clinical factors

Our assessment of milk IgA associations with the clinical characteristics of the study patients was limited by the modest sample size of 21 women and the observed heterogeneity in IgA and IgG responses. Presence or absence of symptoms during sampling had no notable effect on IgA or IgG antibody levels (Supplementary Figs. S3 and S4). Outlier antigens had elevated IgA responses for days since onset of symptoms and days symptomatic that did not reach statistical significance after correction. These proteins included primarily fragments of S2 and N proteins, as well as the ECLIA RBD and NTD proteins (Supplementary Data). Maternal age had similar outliers, which were not significant before P-value adjustment. Other antigens, such as those associated with infant age, had low or negligible levels of IgA reactivity.

Discussion

Lactating COVID-19 patients exhibited diverse and unique IgA kinetic profiles over the course of follow-up since onset of symptoms (Fig. 3). Mother #14 had the highest IgA levels among all patients, and IgA levels against both protein array antigens and ECLIA proteins were high from the initial milk sample taken. Mother #14 was the only responder to the C-terminal region of S1 spanning aa 551–650, and she was also the strongest responder to the full-length N protein. This patient tested positive for SARS-CoV-2 during the end of her pregnancy and had a pre-term delivery. The early high levels of IgA antibodies suggest an anamnestic response, perhaps due to cross-reactivity with other HCoVs. Indeed, this patient had a high response to HCoV-NL63 N protein, however, she did not respond to NL63 S2 protein. It is possible that the patient began having symptoms at a later stage of the SARS-CoV-2 infection, thereby missing longitudinal increase in antibodies.

Fig. 3: Unique longitudinal profiles of mothers’ breast milk IgA response to SARS-CoV-2 show heterogeneity in antibody recognition.
figure 3

a The line plots show breast milk IgA responses to SARS-CoV-2 and human common cold coronavirus selected full-length and protein fragments produced by the cell-free E. coli in vitro transcription and translation system. Antigens were selected to illustrate differences in reactivity profiles of four unique responders to SARS-CoV-2. The timing of breast milk sampling in days since onset of symptoms is shown on a free x-axis, and the normalized IgA signal intensity is shown in the y-axis. Each colored line represents an antigen’s IgA response measured for each of the longitudinal samples of one of the four mothers (each displayed in separate panels). aa amino acid, FL full-length, SCoV2 SARS-CoV-2, HCoV common cold human coronavirus. b The line plots show the log2 signal intensity of IgA responses to SARS-CoV-2 proteins assayed on the ECLIA platform (only SARS-CoV-2 proteins were assayed by ECLIA), where each panel is a unique mother’s responses to the four antigens. The number of days since onset of symptoms when samples were taken is shown on a free x-axis.

Mother #52 exhibited a classical primary immune response with IVTT full-length S2 and N proteins, beginning low and rising at approximately 25 days post-onset of symptoms, which also tracked with NL63 and OC43 S2 proteins. For this patient, it is possible that symptoms began early in the course of SARS-CoV-2 infection. Mother #56 also had a classical primary immune response, with IgA levels peaking at around 25 days post-onset of symptoms, followed by a moderate contraction, particularly with IVTT full-length N protein, but also for the ECLIA proteins (Fig. 3) and purified S protein on the array (Fig. 2). This patient had a protracted period of symptoms, lasting 91 days. Most interestingly, mother #56 responded in the same way with very high levels of IgA to SARS-CoV-2 M protein, but not OC43 or NL63 M protein, perhaps because the immunoreactive N-terminal region of M is poorly conserved among these viruses. Another unique case was mother #26, who was not tested by PCR for SARS-CoV-2 but had a positive PCR result in a breast milk sample. She had a moderate response to SARS-CoV-2 S protein and little else, but responded specifically and solely to a fragment in SARS-CoV-2 N protein spanning aa 251-350. This patient also responded to MERS-CoV N protein and OC43 N protein (Fig. 2). Alignment of this fragment with OC43 N protein showed ~48% sequence identity between the aa 269–328 region of OC43 N protein and aa 257–319 of SARS-CoV-2 N. Alignment of the same region with NL63 N protein showed ~38% sequence identity between aa 232–307 of NL63 N protein and aa 256–331 of SARS-CoV-2 N, although there was no milk IgA reactivity against NL63 N protein. Numerous women in this study, however, responded at low levels or not at all to SARS-CoV-2 antigens with milk IgA or IgG. These patterns together with published studies on systemic immune responses to SARS-CoV-2 illustrate a complex relationship between arms of the immune system during the host response to SARS-CoV-2 infection. The causal link for why one subject responds with milk IgA to different antigenic targets than another subject remains unclear, but may be related to preferential presentation of epitopes based on MHC haplotype or other host factors.

Lack of SARS-CoV-2 testing in most of the infant dyads limited our interpretation of the effect of breast milk antibodies on infant outcomes. Notably, the infant of mother #14, who was the strongest IgA responder to spike protein, tested positive for SARS-CoV-2 and was asymptomatic. It is unknown if there were additional asymptomatic infections among the infants. However, among the five infants with symptoms, unconfirmed for SARS-CoV-2, only one was linked to a mother in the top half of responders to SARS-CoV-2 proteins on the protein array (mother #60). Further study of mothers with confirmed SARS-CoV-2 infection and close monitoring of their infants for symptomatic and asymptomatic infection is needed to examine the association of specific breast milk IgA on infant COVID-19 outcomes.

This study had several limitations. The collection of breast milk samples was not directly observed, and samples were collected with nonstandard sampling time points. Therefore, breast milk samples collected at the onset of symptoms may not reflect duration of exposure to SARS-CoV-2. We relied on maternal reports of SARS-CoV-2 test results, symptoms and treatments received, however, all participants completed a semi-structured interview guided by trained study staff who prompted for specifics with the aid of a calendar. Another limitation was lack of reactivity to the S1 protein and its fragments produced in vitro using an E. coli-based reaction mixture. This was likely due to the lack of eukaryotic post-translational modifications, including N-linked glycosylation which is abundant in S1. We compensated for this by including in the array purified S protein expressed in insect Sf9 cells and RBD expressed in HEK-293 cells, both systems with post-translational modifications, and by assaying purified S, RBD, and the NTD expressed in EXPi293 cells by ECLIA.

The value of this study is in the longitudinal assessment of milk antibody levels and the breadth of antigens covered by the protein microarray and ECLIA platforms, coupled with COVID-19 patient cases with unique profiles. The data show a diverse repertoire of antibody targets in SARS-CoV-2 proteins and a highly heterogeneous profile between lactating women. Thus, infants exposed to SARS-CoV-2 may benefit from breastfeeding, but they may vary in the quantity and quality of SARS-CoV-2 antibodies received.