Abstract

Si-Wu-Tang (SWT), a traditional Chinese herbal formula, has shown an effect on antibreast cancer. However, the mechanisms and bioactive components of SWT are still unclear. Fortunately, cell type-specific molecular network has provided an effective method. This study integrated the data of formula components, all types of biomolecules in the human body, and nonexpressed protein in breast cancer cells and constructed the breast cancer cell network and the biological network that SWT acted on the breast cancer-related targets by Entity Grammar System (EGS). Biological network showed 59 bioactive components acting on 15 breast cancer-related targets. The antibreast cancer mechanisms were summarized by enrichment analysis: regulation of cell death, response to hormone stimulation, response to organic substance, regulation of phosphorylation of amino acids, regulation of cell proliferation, regulation of signal transmission, and affection of gland development. In addition, we discovered that verbascoside played the role of antibreast cancer by inhibiting cell proliferation, but there was not a report on this effect. The results of CCK8 and western blot were consistent with the antibreast cancer effect of verbascoside based on biological network. Biological network modeling by EGS and network analysis provide an effective way for uncovering the mechanism and identifying the bioactive components of SWT.

1. Introduction

Traditional Chinese medicine (TCM) formula, a herb combination, has shown a better effect on treating complex diseases [1, 2], such as diabetes, hypertension, and cancer. In the TCM formula, each herb contains numerous components that offer multitarget, multicomponent synergy, and multidimensional pharmacological actions. Besides, some diseases are related to more than one target [3]. Taken together, there is a considerable challenge for researchers to study the mechanisms and bioactive components of the formula using conventional pharmacological methods.

Fortunately, with the improvement of molecular biology and pharmaceutical chemistry, more and more public databases related to biomolecule and TCM have been constructed. Meanwhile, the large-scale molecular detection in various cells and tissues provides abundant omics data for network construction with the improvement of high throughput detection technologies. And above all the emergence of cell type-specific molecular network provided novel idea and orientation for the mechanism and bioactive components study of TCM formula [4, 5].

Si-Wu-Tang (SWT), a TCM formula, which consists of four herbs including Rehmanniae Radix Praeparata (Rehmannia glutinosa Libosch.), Angelicae Sinensis Radix (Angelica sinensis (Oliv.) Diels), Paeoniae Radix Alba (Paeonia lactiflora Pall.), and Chuanxiong Rhizoma (Ligusticum chuanxiong Hort.), is used for treatment of breast cancer especially the terminal breast cancer [68]. However, the antibreast cancer mechanism and bioactive components of SWT have not yet been systematically analyzed. This study integrated the data of TCM formula components, targets of components, metabolism data, gene-transcription factors, biological reactions, and signaling pathways and constructed the breast cancer cell network and the biological network that SWT acted on the breast cancer-related targets by Entity Grammar System (EGS). The biological network could clearly show the interaction between bioactive components of SWT and biomolecules of breast cancer cells. Through network and enrichment analysis, we could identify the bioactive components and summarize the antibreast cancer mechanism of SWT. Through single pathway analysis, we found a novel antibreast cancer ingredient-verbascoside. CCK8 and western blot were applied to test the antibreast cancer effect and the specific biological pathway of bioactive components.

2. Methods and Materials

2.1. Data Sources

The components of four herbs in SWT were collected from TCM Database@Taiwan [9] and TCMSP [10]; then the components were supplemented and perfected by the literature in CNKI and PubMed (1979∼2019). The name, structure, and SMILES string of components should be recorded. Repetitive components were deleted by name. For components which possess synonym, repetitive components were deleted by structural similarity. The collected components in each herb of SWT can be seen in Supplemental Information 1.

The proteins which could interact with components were derived from STITCH 5.0 [11] and DrugBank version 5.1.4 [12]. In STITCH, the proteins which could interact with components were discovered by name or SMILES string. The required confidence was higher than 0.400 and the active prediction methods included gene fusion, cooccurrence, coexpression, experiments, databases, neighborhood, predictions, and text mining. Because some components could not find their targets in STITCH, we found them by molecular similarity in DrugBank. The similarity threshold was 0.7 and the other parameters values were “default.” The proteins which could interact with the components were recorded in Supplemental Information 2.

The data of biological reactions and signaling pathways were derived from the Reactome [13]. Metabolism data of small molecules was from KEGG [14] and HumanCyc [15]. The data of gene-transcription factors was derived from TRRD [16] and TRED [17]. The data of specific expression proteins was derived from the Gene Expression Omnibus (GSE23610) [18]. Breast cancer-related targets (Supplemental Information 3) were collected from TTD [19] and DrugBank version 4.3.

2.2. The Construction of Breast Cancer Cell Network and the Biological Network That SWT Acted on the Breast Cancer-Related Targets by EGS

How to integrate the fragmentary information and further much useful knowledge is a crucial work. So a method which could model the interactions of molecules in biological systems and simulate the evolution of systems is required. In this study, we used EGS to construct the breast cancer cell network and the biological network that SWT acted on the breast cancer-related targets.

EGS is a quadri-tuple, G = (V, F, P, S), whereas V is the character set representing a basic element, F is a finite set of relations for V, V and F were viewed as entity, P is a set of rules to deduce relationships between entities, and S is the starting entity [20]. According to different objective, we can write different reasoning engine. Providing a starting condition (starting entity, such as components in herb or TCM formula), EGS can obtain the result of the relationship among entities automatically. With these relationships among entities, we can construct formula-biomolecule interaction networks in a specific cell. Here, we used EGS as a framework to find the relationships between components and other entities. The results, after reasoning and rearranging, were visualized with Cytoscape. Thus, we could construct a cell type-specific molecular network. The application of EGS is more flexible compared to traditional network modeling methods. According to different objectives, we can define different V, F, , and S.

The V, F, , and S of the breast cancer cell network and the biological network that SWT acted on the breast cancer-related targets were described by the following:(1) is the set of components of the corresponding herb, is the set of proteins which can interact with components, is the set of breast cancer-related proteins, is the set of rest proteins in the whole biological network of the human body, and is the set of rest complexes or small molecule metabolites in the whole biological network of the human body. is the set of none-express proteins in breast cancer cells, is the set of the number of reactions or metabolisms, and is the set of chemical constituents.(2)F = {reaction (A, B, C, D, E), metainput (A, B, C), metaoutput (A, B, C), metaenzyme (A, B, C, D), tf (A, B, C, D, E, F), noneexpress (A, B, C), biochemreac (A), metabolism (A), notf (A), link (A, B, C), draw (A, B, X, Y), forward (A, B, X, Y), backward (A, B, X, Y), start (A, B), dis (A, B), pict (A, B, X, Y), length (Y)}.In reaction (A, B, C, D, E), , D ∈ , E ∈ {pos, neg, known, enzyme}. It defines that A acts as the reactant or product of the biochemical reaction D with the action of E. In metainput (A, B, C), metaoutput (A, B, C), A ∈ , B ∈ , C represents the reactant or product of the metabolic reaction of B. In metaenzyme (A, B, C, D), A represents the enzyme of metabolic reaction D. The tf (A, B, C, D, E, F) defines that A acts as the transcript factor of C with the action of F. The noneexpress (A, B, C) defines that protein A is not expressed in cell B and labels C as “none.” In biochemreac (A), no tf (A), and metabolism (A), A labels the number of biochemical reactions, metabolism reactions, or transcriptions. In link (A, B, C), , C {pos, neg, known, enzyme}. The link (A, B, C) defines the interaction of all types of molecules which exist in the whole biological network of the human body network. Draw (A, B, C, Y), forward (A, B, C, Y) and backward (A, B, C, Y), , C {pos, neg, known, enzyme}, Y ∈ Z. That is, A acts on B with an effect described in C through Y reactions and the distance number is Y. If a process takes place in one reaction, Y equals 1. The start (A, B), A ∈ , B ∈ . In other words, A represents the components of the corresponding herbs; B represents the targets of compounds. In dis (A, B), A represents the disease-breast cancer, while B represents the breast cancer-related proteins. In pict (A, B, C, Y), A ∈ , B ∈ , C ∈ {pos, neg, known, enzyme}, Y ∈ Z. The pict (A, B, C, Y) represents that A (targets of components) affects B (breast cancer-related protein) with an effect described in C by the reactions of Y. One thing to note here is that the same letters may represent different meaning in various entities; different letters represent the same meaning because they are located in the same entity.(3) = {reaction (A, D, E, F, G), noneexpress (A, B, C) ⇒ biochemreact (F)}, = {not biochemreac (J), reaction (G, H, I, J, K) ⇒ link (G, H, K)}, = {metaenzyme (A, D, E, F), noneexpress (A, B, C) ⇒ metabolism (F)}, = {metaenzyme (A, D, E, G), not metabolism (G) ⇒ link (A, D, G)}, = {metainput (G, H, I), not metabolism (I) ⇒ link (G, H, I)}, = {metaoutput (G, H, I), not metabolism (I) ⇒ link (G, H, I)}, = {tf (A, D, E, F, G, H), noneexpress (A, B, C) ⇒ notf (H)}, = {tf (I, J, K, L, M, N), not notf (N) ⇒ link (I, K, M)}, = {link (A, B, X), start (A) ⇒ draw (A, B, X, M)}, = {draw (A, B, X, N), link (B, C, D), M = N + 1, N < 15 ⇒ draw (A, C, D, M)}, = {draw (C, E, F, X), dis (A, D), E = D⇒pict (C, D, F, X)}, = {pict (C, B, X, Y) ⇒ length (Y)}, = {link (C, B, D), pict (_, B, _, _) ⇒ backward (C, B, D, 1)}, = {link (D, C, E), backward (C, B, D, N), M = N + 1, N < Y, length (Y) ⇒ backward (D, C, E, F)}, = {pict (A, _, _, _), backward (A, C, X, Y) ⇒ forward (A, C, X, 1)}, = {forward (A, C, X, N), backward (C, D, E, F), M = N + 1, N=YF, length (Y) ⇒ forward (C, D, E, M)}. is a set of rules which were utilized to construct the breast cancer cell network. was utilized to compute all the numbers of disease-related proteins that the components could affect and the distance between them. The distances were limited within 15 steps. labels the starting point and labels the ending point. were used to describe the detailed pathway from components target to disease-related proteins, with the specific steps of Y. Forward (C, D, E, M) was the final result used to construct the network. After connecting each of the forward (C, D, E, M), we could get the detailed pathway from components to disease-related proteins.(4) is the set of entities in the biological network of the whole human body for deduction, such as reaction (A, _, _D, E), noneexpress (A, B, C), biochemreac (F), metaenzyme (A, B, C, D), metabolism (A), metainput (A, B, C), metaoutput (A, B, C), tf (A, _, _D, E, F), and link (A, B, X). S2 is the set of starting and ending point proteins and was described by start (A, B) and dis (A, B). In this study, the starting points were defined as none-express proteins in breast cancer cells and components in SWT, respectively. The ending point was defined as breast cancer-related proteins when constructing the biological network that SWT acted on the breast cancer-related targets.

2.3. The Bioactive Components of SWT

Starting with a disease target, this study could identify the bioactive components of SWT reverse biological pathway. In other words, the letter C in forward (C, D, E, M) represents bioactive components.

2.4. Enrichment Analysis

Gene-GO term enrichment analysis was utilized to highlight the most relevant biopathways associated with a given gene list. It was carried by DAVID 6.7 Functional Annotation Clustering [21]. Values of and FDR <0.05 were considered as significantly enriched biopathways.

2.5. CCK8 Was Used to Detect the Influence of Verbascoside on the Proliferation Activity of MCF-7

Verbascoside and paclitaxel were purchased from the National Institutes for Food and Drug Control (Beijing, China). High DMEM basal medium was purchased from HyClone® (Beijing, China). FBS was purchased from GIBCO®/Invitrogen Life Technologies (Beijing, China). Double-antibody was purchased from ProSpec® (Beijing, China).

The human breast cancer cell line MCF-7 was obtained from Jiamay Biotech Co., Ltd. (Beijing, China). The MCF-7 cells of logarithmic growth were inoculated in 96-well plates (5000/well). Holes without cells were used as blank controls. The plate was preincubated at 37°C in a humidified atmosphere comprised of 95% air and 5% CO2. Verbascoside was diluted in DMSO and then added to each well at a final concentration of 0, 5, 10, 20, 40, 80, and 160 μM, which were incubated for 24 h, 48 h, and 72 h. Paclitaxel (3.83 μg/mL) was used as positive control group. The above groups were all set 5 repeats. CCK-8 solution (10 μL) was added to each well of the plates. The plates was incubated for 1∼4 h. The absorbance was measured at 450 nm using a microplate reader (Thermo).

Cell survival rate = A1 (medication group)/A2 (blank control group) 100%; the results were shown as the mean ± standard error of the mean (SEM). The differences between the two groups of independent samples were analyzed by Student’s t-test. was considered statistically significant. The IC50 in different timings were calculated by GraphPad Prism 6.

2.6. Western Blot Was Used to Detect the Effluences of Verbascoside on the Expression of Cdk6, AP-1, and MYC in MCF-7

The MCF-7 cells of logarithmic growth were inoculated in 96-well plates (5000/well). Holes without cells were used as blank controls. The plate was preincubated at 37°C in a humidified atmosphere comprised of 95% air and 5% CO2. Verbascoside was diluted in DMSO and then added to each well at a final concentration of 0, 1, 2, 4, 8, 16, and 32 μM, which were incubated for 48 h. Paclitaxel (3.83 μg/mL) was used as positive control group. The above groups were all set 5 repeats. Cell lysis buffer was prepared by PMSF, cocktail, Triton-100, NaVO3, and base lysis buffer. Protein lysis buffer (150 μL) was added to the cells of black control group, paclitaxel group, and verbascoside group, vortex 8∼10 times, pyrolysis 35∼40 min on ice; The lysates were then centrifuged at 12000 rpm for 15 min at 4°C; the concentration of the total proteins was determined using BCA kit. The proteins were separated by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and then were transferred into a PVDF membrane. The membranes were blocked with 5% skimmed milk in TBST for 2 h at room temperature and incubated with specific primary antibodies overnight at 4°C. The membranes were washed three times and incubated with appropriate secondary antibodies within 1 h at room temperature. Protein bands were detected using the chemiluminescence detection system (Applygen, China). All of the western blots were performed at least three times.

The results were shown as the mean ± standard error of the mean (SEM). The differences between the two groups of independent samples were analyzed by Student’s t-test. was considered statistically significant.

3. Results and Discussions

3.1. The Breast Cancer Cell Network

With none-express proteins in the breast cancer cell as the starting point, all the biological reactions, that is, link (I, K, M), in the breast cancer cell, were visualized with Cytoscape. We could obtain the breast cancer cell network (Figure 1). This network can be used as the background network to qualitatively deduce the antibreast cancer mechanism of SWT.

3.2. The Biological Network That SWT Acted on Breast Cancer-Related Targets by EGS

With all the components as the starting point and breast cancer-related targets as the ending point, the biological networks of SWT that acted on the breast cancer-related targets were constructed by EGS and visualized by Cytoscape (Figure 2).

Figure 2 could overall exhibit the bioactive components and the antibreast cancer mechanism on a molecular level. One component could act on multiple targets and multiple components could act on only one target. SWT played a regulatory role through regulating multiple components, multiple targets, and multiple pathways. These illustrated that the targets and pathways of SWT were extensive. That is to say, its targets and pathways were not specific but comprehensive. Besides, some components could act on one target, but the active directions were opposite. For example, the direction of quercetin acted on NFκB1 and p50 was positive, while palmitoleic acid was negative. SWT may keep the cell state in balance through this opposite regulation.

3.3. The Bioactive Components of SWT

The biological networks clearly showed the 15 breast cancer-related targets affected by 59 bioactive components. These 59 components (Table 1) were considered as bioactive components of SWT. These indicated that multiple components of SWT play an antibreast cancer role through multiple biological pathways and targets.

According to the results computed by EGS, the biopathway of each bioactive component could be extracted and described. For example, kaempferol could inhibit the proliferation of MCF-7 by increasing the expression of ESR2, improving the activity of ESR2 and PPARG, and finally inhibiting EGFR activity.

3.4. Enriched Pathways of Breast Cancer-Related Targets Affected by SWT

Single biopathway affected by bioactive components could only illustrate the antibreast cancer mechanism of SWT in one aspect. Enrichment analysis of breast cancer-related targets affected by SWT could uncover the mechanism comprehensively. These targets were highlighted in 17 biopathways (Table 2).

We find that 17 biopathways could be divided into 7 groups. In other words, the antibreast cancer mechanism of SWT could be summarized as follows: response to organic substances, regulation of amino acid phosphorylation, regulation of cell death, response to hormone stimulation, regulation of cell proliferation, regulation of signal transmission, and affection of gland development. Apparently, they are all closely associated with the occurrence and development of breast cancer [2228].

Some reversed regulations existed in the enrichment analysis of SWT, such as negative regulation of programmed cell death and positive regulation of cell proliferation. These pathways appeared to be opposite to antibreast cancer, but this phenomenon was also reasonable. They may be derived from the opposite active direction of some components. Some components acted on one target with the same or opposite active direction, such as quercetin and palmitoleic acid. These various directions may represent the synergism or antagonism between components. Just because of the synergism and antagonism of these bioactive components, SWT could maintain the homeostasis of the cell. This may be the reason why the drug combination had the potential to enhance efficacy and reduce toxicity [29, 30]. All the results fully demonstrated the curative properties of the TCM formula.

3.5. Biopathway Affected by Verbascoside

In order to clearly show and analyze the antibreast cancer mechanism of SWT, we extracted all the biopathways. The following was the biopathway that verbascoside acted on Cdk6 (Figure 3).

Verbascoside is the index component of Rehmanniae Radix Praeparata. It has some effects such as being neuroprotective [31], antipneumonia [32], antifatigue [33], being cardiotonic [34], etc. Research showed that verbascoside could have competitive binding estrogen receptors in Hela with estradiol, antagonizing the upregulation of ERE luciferase activity by estradiol. The effect of verbascoside is similar to known phytoestrogens such as resveratrol [35], indicating that verbascoside has plant estrogenic activity. In addition, verbascoside is associated with the proliferation and differentiation of colorectal cancer and gastric cancer cells [36, 37], but there are no reports on its effect on antibreast cancer and mechanism. We further verified the antibreast cancer effect of verbascoside in Sections 3.6 and 3.7.

From Figure 3, we could see that verbascoside enhanced the phosphorylation of SMAD3, promoted the expression of AP-1, and inhibited the expression of MYC and Cdk6. SMAD3 could control gene activity and inhibit cell proliferation; Ap-1 could control many cell processes, including differentiation, proliferation, and apoptosis. AP-1 inhibited the expression of cellular adhesion proteins by the phosphorylation of SMAD3 and inhibited nuclear transcription factor AP-1 in the biological pathway SMAD3/AP-1. Enhancing the activity of AP-1 could reduce the expression of MYC, and MYC was an oncogene that was closely related to the cell cycle. MYC was involved in the malignant transformation of human and animal cells and overexpression in many malignant tumors, which could lead to apoptosis of cells such as HL60 [38, 39]. The degradation of MYC could reduce the expression of Cdk6 [40]. Cdk6 was a cyclin-dependent kinase that controlled cell cycle; Cdk6 could make positive regulation of cell proliferation during the G1/S phase transition. In conclusion, verbascoside may play the role of antibreast cancer by inhibiting cell proliferation and promoting apoptosis. This study also tested the effect of antibreast cancer on CCK8 and western blot.

3.6. The Influence of Verbascoside on the Proliferation Activity of MCF-7

Figure 4 showed that the positive drug has the highest inhibition rate, and the inhibition rate was 56%∼62%. The inhibition rate of verbascoside was lower than paclitaxel. The highest inhibition rate of verbascoside was 30%∼38% when its concentration was 20 μM. When the concentrations of verbascoside were 5∼20 μM and 40∼160 μM, its inhibition rate increased; when the concentration is 20∼40 μM, its inhibition rate decreased. All these explained that verbascoside could inhibit the proliferation of MCF-7 at lower concentrations.

The IC50 of verbascoside in 24 h, 48 h, and 72 h were, respectively, 8.85, 4.405, and 4.565 μM. Verbascoside had the best inhibition of proliferation at 24 h, and the inhibition rate decreases at 48 h and 72 h. To sum up, this study chose the concentrations of 2 μM, 4 μM, 8 μM, 16 μM, 32 μM, and 24 h to carry out the western blot.

3.7. The Effluences of Verbascoside on the Expression of Cdk6, AP-1, and MYC in MCF-7

From Figure 5, we could see that the expression of AP-1 increased, and the expression of Cdk6 and MYC decreased. That is to say, verbascoside facilitated the expression of AP-1, inhibited the expression of MYC and Cdk6, and achieved the optimum state at 32 μM. Verbascoside and paclitaxel had the same influence tendency on the expression of the three proteins. The results of Figure 5 were consistent with the analysis in Figure 3.

In conclusion, verbascoside’s antibreast cancer action mechanism enhanced the phosphorylation of SMAD3, promoted expression of AP-1, inhibited expression of MYC and Cdk6, and played the role of antibreast cancer by inhibiting the proliferation of MCF-7. The experimental results were consistent with the antibreast cancer effect of verbascoside based on biological network. All these verified the reliability of the bioactive components identification method based on cell type-specific molecular network.

4. Conclusion

The biological network constructed by EGS could exhibit the bioactive components of SWT and its antibreast cancer mechanism. 59 bioactive components of SWT act on 15 breast cancer-related targets. Each component of SWT plays a synergism or antagonism role through acting on similar biopathways and thus maintains the homeostasis of the cell. The antibreast cancer mechanisms of SWT were summarized by enrichment analysis as follows: regulation of cell death, response to hormone stimulation, response to organic substance, regulation of phosphorylation of amino acids, regulation of cell proliferation, regulation of signal transmission, and affection of gland development.

Single pathway could elucidate the antibreast cancer mechanism of each component. In this study, we identified and validated a novel antibreast cancer ingredient-verbascoside. The results of CCK8 and western blot verified the reliability of the bioactive components identification method based on cell type-specific molecular network. This study will provide the basis for the quality control of SWT and the design of combination drugs based on SWT.

Comparing to protein-protein interaction network (PPI), cell type-specific molecular network constructed by EGS is belonging to a multimolecular type network. EGS could integrate the information of gene regulation, metabolic network, cell signal, etc. Effectively, realizing batch processing is an efficient and flexible information fusion method. When combined with experimental data, especially all kinds of omics data, the space scale characteristics of complex systems are fully considered. At present, no studies have been reported on the mechanism of action of TCM using cell type-specific molecular networks. This study was based on the data included in the existing database or literature, and we did not take into account the quantity of components, so this method still has some limitations. With further study of the bioactivity of SWT components and breast cancer-related biomolecules, the data we used will be more complete and the results will be more precise and integrated.

Data Availability

The datasets used during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare that there are no conflicts of interest.

Acknowledgments

This work was supported by the Youth Science Foundation Project of the National Natural Science Foundation of China (Grant Nos. 81903931 and 81673697).

Supplementary Materials

Supplemental Information 1: 656 components of SWT. Supplemental Information 2: the proteins which could interact with the components of SWT. Supplemental Information 3: 60 breast cancer-related targets. (Supplementary Materials)