Introduction

India is considered as one of the high tuberculosis (TB) burden countries and accounts for two-third of the total global cases1. The currently followed drug regimen for TB treatment is Isoniazid (INH), Rifampicin (RIF), Pyrazinamide (PZA), Ethambutol (EMB) daily for 2 months (initial phase) followed by INH and RIF daily for 4 months (continuous phase)2. Although the current anti-TB drug therapy is effective, still it is associated with various side effects predominantly hepatotoxicity and peripheral neuritis.

Anti-tubercular drug-induced liver injury (AT-DILI) is one of the major reason for the development of hepatotoxicity which can lead to treatment failure among TB patients3. Frequency of AT-DILI varies from 2 to 39% among different countries4. Some cross-sectional and retrospective studies have reported 7.9% incidence and 9.48% prevalence of hepatotoxicity among TB patients on anti-tuberculosis therapy5,6. INH, RIF and PZA are the main causative agents for the development of hepatotoxicity in TB patients7,8,9. Additionally, case control studies have shown that TB patients with older age, associated comorbid conditions, chronic alcoholics, suffering from chronic liver diseases and having lower body mass are more susceptible to develop drug-induced hepatotoxicity6,10. Anti-TB drug-induced hepatotoxicity can lead to irreversible liver failure that may require liver transplantation and, if not recognized, can be fatal11. Current diagnosis involves analysis of various liver function parameters like alanine aminotransferase (ALT), aspartate aminotransferase (AST) and total bilirubin (TBIL). However, AT-DILI lacks differential diagnosis and often is characterized similar to other forms of DILI as AST and ALT levels possess poor specificity for the liver.

The advancement in omics-based technologies, including genomics, proteomics and metabolomics can serve as an important tool to unravel the mechanism of DILI12. Mass spectrometry-based high throughput technique is the best approach to profile global protein expression, which can be employed in complex biological samples including serum to quantify proteins13. The efficiency to identify novel biomarkers from clinical specimens has dramatically improved due to quantitative proteomics approaches14. Serum biomarkers, including glutamate dehydrogenase, keratin 18, glutathione S-transferase and many others, have recently been documented as potential biomarkers to characterize DILI but are yet in their infancy and need further exploration15.

Hence, in the present study, we hypothesized that the proteomics approach would identify potential proteins with high discriminative ability in TB patients with and without hepatotoxicity. Further validation of differentially expressed proteins in serum of TB patients on anti-tubercular drugs may lead to the identification of protein signatures which may  play a crucial role in development of AT-DILI and could be plausible therapeutic targets.

Results

Patient recruitment and study groups

Study subjects were recruited based on the inclusion and exclusion criteria and were categorized into different groups (Table 1) based on various liver function parameters. The demographic profile of patients along with biochemical parameters is summarized in Tables 2 and 3 respectively. The mean age of hepatotoxic patients was relatively higher than the other three groups; however, the findings were non significant. The TB patients irrespective of drug treatment or hepatotoxicty, were found to have significant low BMI as compared to healthy control. Patients recruited in the study were not having any other associated comorbidities, like diabetes, HIV or other chronic liver disease. A significant increase was seen in the serum levels of alanine aminotransferase (ALT), aspartate aminotransferase (AST), total bilirubin (TBIL), alkaline phosphatase (ALP) and gamma-glutamyl transferase (GGT) (Table 3) in Group 4 patients as compared to Groups 1, 2 and 3 indicating hepatotoxicity in Group 4 patients.

Table 1 Description of study groups.
Table 2 Demographic details of study subjects enrolled in different groups.
Table 3 Serum biochemical analysis of included study subjects.

Shotgun proteomic analysis and relative protein quantification

After protein quantification in serum samples of defined study groups, 10 samples of each group were pooled taking equal protein concentration and two high abundant serum proteins (albumin and immunoglobulin G) were removed. SDS PAGE demonstrated efficient depletion of 90% of albumin (mol. wt. = 66.5 kDa) and immunoglobulin G (IgG) (mol. wt. = 150 kDa) which together constitute 50–70% and 10–25% of human serum proteins, respectively from pooled serum samples of defined study groups 1, 2, 3 and 4. The representative original gel images are shown in Fig. 1.

Figure 1
figure 1

Representative images of SDS–PAGE: Protein differences of pooled serum samples are shown before and after depletion of two high abundant proteins including albumin and IgG (a) Groups 1 and 2 (Lane 1: Ladder, Lane 2: Pooled Group 1 serum, Lane 3: Two proteins depleted fraction of group 1, Lane 4: Pooled Group 2 serum and Lane 5: Two proteins depleted fraction of group 2) and (b) Groups 3 and 4 (Lane 1: Ladder, Lane 2: Pooled Group 3 serum, Lane 3: Two proteins depleted fraction of group 3, Lane 4: Pooled Group 4 serum and Lane 5: Two proteins depleted fraction of group 4.

The depleted serum samples of each group were run in duplicates on LC–MS/MS. To avoid false protein identification, peptides with high confidence and proteins with a minimum of one unique peptide sequence were allowed for protein grouping. A total of 376 proteins with at least one unique peptide were identified, out of which 276 proteins were filtered out and relatively compared by applying a filter ‘found in samples’ has confidence ‘not found’ in at most 1 sample among defined study groups. By applying this filter, the proteins which were detected in at least 3 sample groups were considered for relative quantification and were further statistically and bioinformatically studied for differential expression and functional analysis.

Comparative proteomic analysis and identification of signature proteins

Comparison of the newly diagnosed TB group (Group 2) versus the healthy group (Group 1) led to the identification of 14 significant differentially expressed proteins. Volcano plots highlighted a significant difference between the healthy and freshly diagnosed TB groups (Fig. 2a). Some of the upregulated proteins present in the serum of TB patients include Coagulation factor IX (F9), Proteoglycan 4 (PRG4), isoform LMW of Kininogen-1 (KNG1) and Leucine-rich alpha-2-glycoprotein (LRG1). In contrast, the proteins such as Apolipoprotein A-IV (APOA4), Mannose-binding protein C (MBL2) and CD44 antigen (Fragment) (CD44) were found to be downregulated in newly diagnosed TB patients (Group 2) as compared to healthy controls (Group 1) (Fig. 2b, c and Table 4).

Figure 2
figure 2

Volcano plot and heatmap representation of newly diagnosed TB (Group 2) versus healthy group (Group 1): Statistically significant normalized grouped abundance ratios of (a) Newly diagnosed TB versus a healthy group with − log10 p-value (Y) and log2 fold change (X). Heatmap of differentially expressed proteins from newly diagnosed TB versus healthy group comparison of (b) upregulated and (c) downregulated proteins where G1 (Healthy), G2 (newly diagnosed TB), G3 (TB without hepatotoxicity) and G4 (TB with hepatotoxicity) represent grouped abundance of each group. A p-value < 0.05 is considered as statistically significant. Heatmaps and volcano plot generated using Proteome Discoverer 2.5 (Thermo Scientific, Bremen, Germany).

Table 4 A comparison of significantly differentially expressed proteins from newly diagnosed TB (G2) versus healthy (G1) versus groups.

A comparison of TB without hepatotoxicity (Group 3) versus newly diagnosed TB group (Group 2) showed 15 significant differentially expressed proteins (Fig. 3a). Some of the upregulated proteins include Mannan-binding lectin serine protease 2 (MASP2), Tetranectin (CLEC3B), Mannose-binding protein C (MBL2), Complement C3 (C3), Complement C5 (C5) and Inter-alpha-trypsin inhibitor heavy chain H1 (ITIH1) (Table 5, Fig. 3b, c). Moreover, a comparison of TB with hepatotoxicity versus newly diagnosed TB groups revealed 13 differentially expressed proteins (Fig. 4a). The upregulated proteins include Apolipoprotein C-I (APOC1), Mannose-binding protein C (MBL2) and Apolipoprotein E (APOE) etc. In contrast, the observed downregulated proteins include LRG1, Serum amyloid P-component (APCS) and Proteoglycan 4 (PRG4) (Table 6, Fig. 4b, c).

Figure 3
figure 3

Volcano plot and heatmap representation of TB without hepatotoxicity (Group 3) versus newly diagnosed TB group (Group 2) groups: Statistically significant normalized grouped abundance ratios of (a) TB without hepatotoxicity versus newly diagnosed TB with − log10 p-value (Y) and log2 fold change (X). Heatmap of differentially expressed proteins from TB without hepatotoxicity versus newly diagnosed TB groups comparison of (b) upregulated and (c) downregulated proteins where G1 (Healthy), G2 (newly diagnosed TB), G3 (TB without hepatotoxicity) and G4 (TB with hepatotoxicity) represent grouped abundance of each group. A p-value < 0.05 is considered as statistically significant. Heatmaps and volcano plot generated using Proteome Discoverer 2.5 (Thermo Scientific, Bremen, Germany).

Table 5 A comparison of significantly differentially expressed proteins from TB without hepatotoxicity (G3) versus newly diagnosed TB (G2) groups.
Figure 4
figure 4

Volcano plot and heatmap representation of TB with hepatotoxicity (Group 4) versus newly diagnosed TB group (Group 2) groups: Statistically significant normalized grouped abundance ratios of (a) TB with hepatotoxicity versus newly diagnosed TB with − log10 p-value (Y) and log2 fold change (X). Heatmap of differentially expressed proteins from TB with hepatotoxicity versus newly diagnosed TB groups comparison of (b) upregulated and (c) downregulated proteins where G1 (Healthy), G2 (newly diagnosed TB), G3 (TB without hepatotoxicity) and G4 (TB with hepatotoxicity) represent grouped abundance of each group. A p-value < 0.05 is considered as statistically significant. Heatmaps and volcano plot generated using Proteome Discoverer 2.5 (Thermo Scientific, Bremen, Germany).

Table 6 The comparison of significantly differentially expressed proteins from TB with hepatotoxicity (G4) and newly diagnosed TB (G2) groups.

Furthermore, TB with hepatotoxicity group (Group 4) in comparison to TB without hepatotoxicity (Group 3) showed 24 significant differentially expressed proteins (Table 7, Fig. 5A a–c). Vitamin K-dependent protein S (PROS), ADP-ribosyl cyclase/cyclic ADP-ribose hydrolase (Fragment) (BST1) and Apolipoprotein B-100 (APOB) were found to be upregulated whereas downregulated proteins include Phosphatidylcholine-sterol acyltransferase (LCAT), Adiponectin (ADIPOQ), Serum amyloid P-component (APCS), Proteoglycan 4 (PRG4) and Transthyretin (TTR). Some of the observed proteins were similar to the identified proteins in TB with hepatotoxicity (Group 4) compared to newly diagnosed TB group (Group 2) (Fig. 4a) or with the healthy group (Group 1) (data shown in supplementary Fig. 1). Noteworthy, some protein signatures like PROS1, KNG1, CFH, LCAT, APCS and ADIPOQ showed the potential to distinguish the hepatotoxicity group from other defined study groups. However, only APCS protein showed significant downregulation in Group 4 as compared to Groups 1,2,3 suggesting its potential role in anti-tubercular drug induced hepatotoxicity (Fig. 5B).

Table 7 The comparison of significantly differentially expressed proteins from TB with hepatotoxicity (G4) versus TB without hepatotoxicity (G3) groups.
Figure 5
figure 5

Volcano plot and heatmap representation of TB with hepatotoxicity (Group 4) versus TB without hepatotoxicity (Group 3) groups: Statistically significant normalized grouped abundance ratios of (A,a) TB with hepatotoxicity versus TB without hepatotoxicity with − log10 p-value (Y) and log2 fold change (X). Heatmap of differentially expressed proteins from TB with hepatotoxicity versus TB without hepatotoxicity groups comparison of (A,b) upregulated and (A,c) downregulated proteins. (B) The trend chart of signature proteins including (a) PROS1, (b) KNG1, (c) CFH, (d) LCAT, (e) APCS and (f) ADIPOQ which showed the potential to distinguish the hepatotoxicity group from other groups where G1 (Healthy), G2 (newly diagnosed TB), G3 (TB without hepatotoxicity) and G4 (TB with hepatotoxicity) represent grouped abundance of each group. A p-value < 0.05 is considered as statistically significant. Heatmaps and volcano plot generated using Proteome Discoverer 2.5 (Thermo Scientific, Bremen, Germany).

GO annotation and pathway analysis of differentially expressed proteins

The gene symbol of differentially expressed proteins identified among TB with hepatotoxicity versus TB without hepatotoxicity groups was listed in String 11.0 for interaction analysis. It showed a strong interaction network (Fig. 6a). The gene ontology (GO) functional enrichment analysis by biological process and cellular component was performed with the differentially expressed proteins (Fig. 6b, c). It showed that proteins were mostly involved in biological processes like complement activation, alternative pathway (CFH, C5 and C3), triglyceride-rich lipoprotein particle remodelling (LCAT and APOB), fibrinolysis (PROS1 and FGA) with platelet degranulation (CLEC3B, PROS1, FGA, TF and SERPINA3) and were part of high-density lipoprotein particle (APOB and LCAT), blood microparticle (PROS1, GSN, CFH, C3, APCS, C4BPA, VTN, FGA, TF and SERPINA3) and platelet alpha granule lumen (PROS1, FGA and SERPINA3) of the cell. Thus, the modulated proteins were involved in complement and coagulation cascades (F9, C5, VTN, C3, FGA, C4BPA, CFH, PROS1, MASP2) and cholesterol metabolism (APOB, LCAT), which might play a role in drug-induced liver injury (Fig. 6d).

Figure 6
figure 6

Bioinformatics analysis of differentially expressed proteins: A strong interactome network (a) of modulated proteins by String 11.0 among TB with hepatotoxicity and TB without hepatotoxicity group comparison42. Functional enrichment analysis utilizing Gene Ontology43 (b and c) shows that most modulated proteins are involved in complement activation (alternative pathway), triglyceride-rich lipoprotein particle remodelling, fibrinolysis with platelet degranulation, and belongs to high-density lipoprotein particle, blood microparticle and platelet alpha granule lumen of the cell. (d) KEGG pathway enrichment analysis44 shows that modulated proteins are primarily engaged in complement and coagulation cascades and cholesterol metabolism.

Validation of serum amyloid P component protein (APCS) in serum samples of different study groups

ELISA was used for validation of APCS protein which was observed to be downregulated specifically in TB with hepatotoxicity group (Group 4) in comparison to TB without hepatotoxicity (Group 3). APCS protein showed a significant downward trend and statistical analysis revealed comparable results to those obtained in discovery phase (Fig. 7a). Receiver Operating Characteristic (ROC) curve analysis showed 90% AUC (area under the curve) values with 100% sensitivity and 87% specificity as revealed by Youden index analysis, confirming its diagnostic potential that can further be explored in larger cohort (Fig. 7b).

Figure 7
figure 7

Boxplot representation and ROC curve analysis of APCS protein: The levels of significantly downregulated (a) serum amyloid P component (APCS) in serum samples of G4 (TB with hepatotoxicity) versus G3 (TB without hepatotoxicity) study subjects. Mann–Whitney test was used to compare serum APCS protein levels. (b) ROC and Youden index analysis was done for the APCS to investigate their discriminatory potential between TB patients with and without hepatotoxicity (n = 8 for each group; Conc. = concentration; CI: Confidence interval).

Discussion

Clinically, an attractive diagnostic tool for characterizing DILI is based on measuring biochemical markers including ALT, AST, ALP, Total Bilirubin and GGT in serum16,17. Despite the availability of these traditional biomarkers, the investigations to find a suitable candidate(s) that can predict DILI at an earlier onset are still in progress as conventional markers lack sensitivity and specificity, which further possess limitations in deciding subsequent treatment regimens. Proteomics, along with other omics technologies, has been explored to find out extensive list of proteins, microRNAs and many other biomarkers related to DILI but still no biomarker is in clinical utility15. In present study, 376 proteins were quantified by label-free quantification in pooled serum samples and the number of quantified proteins is concordant with other proteomics-based studies in serum of TB patients, while because of technical variations, a higher number of proteins were also identified in other studies18,19,20. Prior to mass spectrometry run, the serum samples were depleted of albumin and IgG as the high abundant proteins depletion is one of the common strategies to identify low abundant proteins which holds biomarker potential21,22.

Out of identified 376 proteins, 276 proteins were filtered out and compared among defined study groups. Newly diagnosed TB group versus healthy group comparison presented many upregulated proteins, for instance, F9, PRG4, KNG1 and LRG1. While the proteins such as APOA4, BTD, MBL2, C4A, C5, HBG1 and others were found to be significantly decreased. The findings are consistent with the recent available literature on serum proteomic profiling of healthy and TB patients18,23,24. The levels of Mannose binding lectin-2 (MBL2) were found to be decreased in TB patients versus the healthy group. In contrast, an increasement in its level was observed upon initiation of anti-TB drug treatment, which remained high in the toxic group, suggesting its involvement in the pathogenesis of TB. MBL is known to play a potential role in innate immunity and is decreased in TB patients, further, increasing susceptibility to infections25.

Next, to evaluate the effect of anti-TB treatment on the identified protein profile, the newly diagnosed TB group was compared with subjects with hepatotoxicity and without hepatotoxicity groups. Interestingly, the proteins including IGHG4, ITIH1, LRG1, MBL2 and LRG1 showed an opposite trend of differential expression when the newly diagnosed TB group was compared among TB group without hepatotoxicity and healthy group, respectively, suggesting an influence of anti-TB drug treatment. Other proteins, including CLEC3B, PIGR, FGA and ATRN (upregulated proteins) and C3, C5 and BST1 (downregulated proteins), were also differentially expressed, suggesting the possible effect of anti-TB drugs on the expression of enlisted proteins. The observed experimental findings are similar to the previously reported studies which suggest that reduced C3 levels are correlated with the improved treatment outcome in TB patients26,27.

Furthermore, as the study primarily focused on characterizing the protein signatures significantly modulated in TB patients with hepatotoxicity, a subset of proteins were observed to be differentially expressed among TB with hepatotoxicity versus those without hepatotoxicity groups. Some specific proteins include PROS1, APOB, IGHV3-49, SERPINA3, VTN, KNG1, CFH, F9 (upregulated) and TF, LCAT, FGA, C4BPA, PTPRJ, TTR, APCS, ADIPOQ, GSN, CLEC3B, MASP2 (downregulated). Some of the observed proteins, including SERPINA3, KNG1 and APCS, are involved in TB pathogenesis28,29,30. The Complement C3 (C3) protein, along with Complement factor H (CFH), was found to be upregulated in TB with the hepatotoxicity group. CFH is one of the primary regulators of the alternative pathway. Ultimately, all the three complement pathways (including classical, lectin and alternative) lead to activation of C3 convertase and downstream cleavage of C3. This leads to opsonization of bacteria by macrophages, and activation of C5 convertase which may result in mycobacterial cell lysis by forming a membrane attack complex31. The increased levels of C3 as observed in the hepatotoxicity group might be linked with TB treatment failure and reduced C3 levels have been correlated with the improved treatment outcome26. Our findings are supported by available literature where increased oxidative stress is linked with complement-dependent (particularly C3, CFH and C5 components) inflammatory responses32,33. Vitamin K-dependent protein S (PROS1) levels were also upregulated in the hepatotoxicity group. It is known that PROS1 is synthesized in the liver which plays a central role in blood coagulation and acts as an anticoagulant plasma protein34. A decrease in Adiponectin (ADIPOQ) levels, an adipokine, was found in pulmonary TB patients. At the same time, an increase was observed after initiation of antitubercular therapy35. However, surprisingly in our study, we observed that the levels of Adiponectin showed a downward trend in the hepatotoxicity group suggesting further suggesting its diagnostic potential in hepatotoxicity group. Another protein i.e., serum amyloid P-component (APCS) showed a similar trend in all the three study groups (Group 1, 2 and 3) except in the TB with hepatotoxicity group (Group 4), where its level showed a significant decrease. Serum amyloid P-component (APCS gene) along with CRP are classic pentraxin proteins. APCS protein plays a crucial role in regulating the matrix formation and provides resistance to infectious agents including mycobacteria36,37,38. One targeted proteomics study using extracellular vesicles highlights that levels of APCS and pro-platelet basic protein were significantly decreased in liver fibrosis progression, patients with liver cirrhosis, acute hepatitis and non-alcoholic fatty liver disease patients39. However, no reports are avaibale where levels of APCS have been correlated with anti-tubercular hepatotoxicity. Based on its specific and significant decrease in hepatotoxicity group and being a critical protein in combating the infection, APCS protein was shortlisted for further validation by ELISA. The validation phase data correlated well with the discovery phase suggesting its biomarker potential in differentiating anti-tubercular drug-induced liver injury from other defined study groups. Henceforth, our resuts suggest APSC as a potential candidate protein to further explore its role in anti-tubercular drug induced liver injury. This study can further be strengthened by validating other essential signature proteins (PROS1, KNG1, CFH, LCAT and ADIPOQ) in larger cohort which have shown significant differential expression in the TB group with hepatotoxicity as compared to without hepatotoxicity group. Also, the present study is a cross-sectional study that can be redesigned to a randomized one to diagnose the transition of markers in different stages of the disease.

Conclusion

Serum proteomic analysis revealed significant changes in the expression profile of signature proteins in TB with hepatotoxicity patients versus TB without hepatotoxicity patients. An interplay among differentially regulated proteins highlighted their involvement in complement and coagulation cascades and cholesterol metabolism, which might play a significant role in AT-DILI in TB patients. Among differentially expressed proteins including PROS1, KNG1, CFH (upregulated) and APIPOQ, APCS and LCAT (downregulated), APCS was shortlisted for validation based on functional relevance with hepatotoxicity. The protein showed high discriminative ability with 100% sensitivity and 87% specificity. Henceforth, our results suggest APCS as a potential candidate protein which further can be explored to study its potential role in anti-tubercular drug induced liver injury.

Methods

Protein quantification was performed by Pierce™ BCA Protein Assay Kit (Thermo Fisher Scientific, USA), and high abundant protein depletion was achieved by Aurum™ Serum Protein Mini Kit (Bio-Rad). Trypsin (Promega gold standard), Guanidine Hydrochloride (GnHCl), Dithiothreitol (DTT), iodoacetic acid (IAA), and trifluoroacetic acid (TFA) were procured from Sigma-Aldrich while LC–MS Grade water and Pierce™ C18 Spin columns (Cat. 89870) were procured from Thermo Fisher Scientific Inc, USA. Protein ladder 10–250 kDa (Precision Plus Protein™ Dual Color Standards) was purchased from BioRad.

Study subjects

The Institute Ethics Committee (IEC) of Postgraduate Institute of Medical Education and Research (PGIMER), Chandigarh reviewed and approved the study protocol with ref. no. IEC No. 10/2014-72 and NK/5864/MSc/137 and all procedures were performed as per Declaration of Helsinki guidelines. Study subjects were recruited, and blood samples were withdrawn from patients attending medical OPD and DOTS Center, PGIMER, Chandigarh, considering the inclusion and exclusion criteria of the study. Briefly, patients of either sex with any type of TB in the age range of 15–65 years were enrolled. The subjects were divided into four groups i.e., Healthy control subjects having no signs and symptoms of TB and were free from any liver disease (Group 1), newly diagnosed TB subjects not on TB treatment (Group 2), TB patients on antitubercular drug therapy not having hepatotoxicity (Group 3), and TB patients on anti-TB treatment and having hepatotoxicity (Group 4) (ALT or AST ≥ three-fold rise). Groups 3 and 4 received the same anti-TB drugs (essentially with RIF, PZA and INH);. Along with biochemical investigations, chest X-ray, sputum test for AFB positivity, Montoux test or microscopic examination for TB confirmation was assessed by concerned physician attending chest or medical clinic. Height and weight were also measured for BMI calculation using formula: weight (kg)/height (m2). Patients with human immunodeficiency virus infection, viral hepatitis, simultaneous consumption of other potentially hepatotoxic medications like methotrexate, phenytoin, fluconazole and valproate etc., chronic liver disease, long-term alcoholics and pregnancy were excluded40.

Blood sample collection and categorization of study subjects

Serum was separated from the collected blood samples by centrifugation at 1500 g × 10 min and kept at − 80 °C until further analysis. Biochemical parameters including ALT, AST, ALP, TBIL, GGT and total protein levels were estimated in serum of recruited study subjects on Roche/Hitachi Cobas c-8000 system. Values ≥ three times the upper limit of normal (ULN) of ALT, AST and TBIL were considered as criteria of hepatotoxicity41. After screening, 40 patients were included, categorized, and distributed equally in defined study groups. The demographic data of recruited patients in the respective study groups are summarized in Table 2. The individual serum samples of each representative group were pooled by considering that an equal amount of protein was taken from each sample. The protein concentration was measured by by Nanodrop One Microvolume UV–Vis spectrophotometer (Thermo Scientific™) and by BCA assay using bovine serum albumin as standard. Serum dilutions (1:100) were prepared using distilled water.

Depletion of high abundant proteins and top–down assessment by SDS-PAGE

Two highly abundant proteins (albumin and immunoglobulin G) were removed using a serum-aurum kit as per the manufacturer’s protocol. Briefly, 60 µl of each pooled serum sample was diluted with 180 µl of Aurum serum protein binding buffer and was placed over the resin bed of the column. The column was vortexed and incubated for 15 min., centrifuged at 10,000 × g for 20 s, and depleted fractions were collected. The protein concentration thus obtained from depleted fractions was quantified by BCA assay. For a gross assessment of efficient depletion, 10 µg protein /gel lane was separated on 12% SDS PAGE, followed by coomassie blue staining to visualize protein bands.

In-solution digestion

For LC–MS/MS analysis, 20–25 µg of the depleted protein samples suspended in the binding buffer of the serum-aurum kit was acetone precipitated (3 h at − 20 °C) and centrifuged at 22,000 × g at 4 °C for 10 min. The obtained pellet was air-dried and re-suspended in hot (90 °C) 6 M guanidine hydrochloride (GnHCl)/0.1 M Tris solution. Freshly prepared 10 mM DTT was added as a reducing agent followed by incubation at 65 °C for 45 min. in the dark. Then, 50 mM freshly prepared IAA was added as an alkylating agent and incubated in the dark for 45 min. Samples were diluted with LC–MS grade water to achieve 0.6 M GnHCl concentration, and pH was maintained between 8 and 8.5. The protein mixture was digested with trypsin at an enzyme-to-protein ratio of 1:20 (w/w) to generate peptides and kept at 37 °C for 16 h. Trypsin activity was quenched by adding 10% TFA (2–3 µl), and pH was maintained at 2. The peptide samples were purified and concentrated using the Pierce® C18 Spin columns as per the manufacturer’s instructions. After clean-up, the peptide mixture was quantified by measuring absorption at 280 nm using Nanodrop.

LC–MS/MS analysis

Depleted serum fractions of each group were run in duplicates on Orbitrap Fusion Tribrid Mass Spectrometer (Thermo Fisher Scientific). Peptide samples (~ 1ug) were loaded on reverse phase EASY-Spray Acclaim PepMap C18 Column (15 cm × 75 μm I.D., packed with C18 resin, three μm particle size, 100 Å; Thermo Scientific, Part No ES800), coupled to Easy NanoLC 1200 system (Thermo Fisher Scientific, San Jose, US). The peptide separation was achieved with a constant flow rate of 300 nl/min using a linear gradient of 5–95% solvent B (80% ACN in 0.1% formic acid) with solvent A (2% ACN in 0.1% formic acid) for 95–100 min. MS master scan was acquired at Orbitrap mass analyzer in the m/z range of 375–1600 with a mass resolution of 120,000 at m/z 200. Data-dependent MS/MS acquisition was achieved by IonTrap mass analyzer with 3s as cycle time. Quadrupole with an isolation window of 1.2 m/z was selected as an isolation mode. AGC target was set to 5000, and the maximum injection time was 45 ms. The precursor ion fragmentation with charge states 2–5 was achieved using higher-energy collisional dissociation at 30% collision energy. The lock mass option was enabled for real-time calibration using polycyclodimethylsiloxane (m/z, 445.12) ions.

Data analysis

The raw files generated for each sample were analyzed by Proteome Discoverer 2.5 (Thermo Scientific, Bremen, Germany) for protein identification. Database searches were carried out using Sequest HT algorithms with percolator (Strict FDR = 0.01 and relaxed FDR = 0.05) against UniProt Human database (ID: UP000005640, 96,807 sequences, last modified on 3/9/2020). The precursor mass range of 350–5000 Da and S/N threshold was set as 1.5 for generating the peak list file. The parameters set for database searches include trypsin as a protease with two missed cleavages and a minimum peptide length of 6 aa, Oxidation/+15.995 Da (M) as dynamic oxidation and Carbamidomethyl/+57.021 Da (C) as static Modification. Precursor Mass Tolerance of 10 ppm and fragment Mass Tolerance of 0.6 Da were allowed. Calculation based on the score was used as peptide validator with target FDR (False discovery rate) for Peptide-Spectrum Matches (PSMs) as 1% (Strict) and 5% (Relaxed), and similar target FDR criteria were set for peptides as general validation settings. Peptide and protein filters were set so that peptides with at least high confidence and proteins with a minimum of one peptide sequence were allowed, and the strict parsimony principle was true for protein grouping. Peptides selected for quantification were Unique + Razor. Precursor quantification was based on precursor intensity which was used as precursor abundance value. Generated data was normalized with normalization mode selected as the total peptide amount. Protein abundance calculation was based on summed abundances.

Statistical and bioinformatic analysis

For differential expression, the relative abundance ratio of ≥ two-fold was used for up-regulation and ≤ 0.5-fold abundance ratio for down-regulation, where protein ratio calculation was protein abundance-based. p-value was calculated by applying ANOVA (Individual proteins) with Tukey HSD posthoc analysis as a hypothetical test, and a value < 0.05 was considered a significant abundance ratio. The volcano plots and heatmaps of differentially expressed proteins were exported from Proteome Discoverer 2.5 (Thermo Scientific, Bremen, Germany). The statistically significant differentially expressed proteins were analyzed by STRING 11.0 for visualization of interactome among the proteins42. Moreover, the protein–protein interactions was followed by STRING in-built Gene Ontology (including biological processes and cellular components) and KEGG pathway for functional annotation and enrichment analysis43,44. For Gene ontology analysis, fisher’s exact test was selected to calculate the p-value with a cutoff of < 0.001 as statistical criteria. FDR was calculated as test correction for functional enrichment, and values were represented as − log10 FDR. While for KEGG pathway anaylsis, FDR was calculated according to Benjamini and Hochberg’s test.

ELISA of serum amyloid P component protein and receiver operating characteristic analysis

The shortlisted Serum amyloid P component (APCS) protein from TB with hepatotoxicity versus without hepatotoxicity group were chosen for validation using ELISA kit (Elabscience): E-EL-H1279. Briefly, the collected serum samples (8 TB patients with hepatotoxicity and without hepatotoxicity groups, respectively) were diluted in the sample diluent provided with the kit following the manufacturer’s instructions and immunodetection was achieved. The absorbance was read at 450 nm in a Tecan—Microplate Reader—Infinite 200 Pro (DKSH Holding Ltd.). The diagnostic accuracy of APCS protein was evaluated by the receiver operator characteristics (ROC) curve analysis. The cut-off values were evaluated by calculating the Youden Index in order to maximize the sensitivity and specificity. Statistical analysis (Mann–Whitney) for ELISA was performed using GraphPad Prism 8.0.2.

Ethics approval

The institutional ethical committee (IEC) approved the study with ref No. (IEC No. 10/2014-72 and NK/5864/MSc/137.

Consent to participate

Informed consent was obtained from all the participants included in the study.