Keywords
Androgens, AR, prostate cancer, alternative splicing, alternative promoters, alternative 3' ends, transcription, mRNA isoforms
Androgens, AR, prostate cancer, alternative splicing, alternative promoters, alternative 3' ends, transcription, mRNA isoforms
A single human gene can potentially yield a diverse array of alternative mRNA isoforms, thereby expanding both the repertoire of gene products and subsequently the number of alternative proteins produced. mRNAs with different exon combinations are transcribed from most (up to 90%) human genes, and can generate variants that differ in regulatory untranslated regions, or encode proteins with different sub-cellular localisations and functions1–5. Altered splicing patterns have been suggested as a new hallmark of cancer cells6–8, and in prostate cancer there is emerging evidence that expression of specific mRNA isoforms derived from cancer-relevant genes may contribute to disease progression9–11.
Androgen steroid hormones and the androgen receptor (AR) play a key role in the development and progression of prostate cancer, with alternative splicing enabling cancer cells to produce constitutively active ARs11–13. The AR belongs to the nuclear receptor superfamily of transcription factors, and is essential for prostate cancer cell survival, proliferation and invasion14–16. Classically, androgen binding promotes AR dimerization and its translocation to the nucleus, where it acts as either a transcriptional activator or a transcriptional repressor to dictate prostate specific gene expression patterns17–23. The major focus for prostate cancer therapeutics has been to reduce androgen levels through androgen deprivation therapy (ADT), either with inhibitors of androgen synthesis (for example, abiraterone) or with antagonists that prevent androgen binding to the AR (such as bicalutamide or enzalutamide)24. Although ADT is usually initially effective, most patients ultimately develop lethal castrate resistant disease for which there are limited treatment options11,12.
Androgens and other steroid hormones have also been associated with alternative splicing. Recent RNA-sequencing-based analysis of the androgen response of prostate cancer cells grown in vitro and within patients following ADT identified a set of 700 genes whose transcription is regulated by the AR in prostate cancer cells25. However, in addition to regulating transcriptional levels, steroid hormone receptors can control exon content of mRNA10,26–29. In prostate cancer androgens can modulate the expression of mRNA isoforms via pre-mRNA processing and promoter selection9,10,18,30. The AR can recruit the RNA binding proteins Sam68 and p68 as cofactors to influence alternative splicing of specific genes, and studies using minigenes driven from steroid responsive promoters indicate that the AR can affect both the transcriptional activity and alternative splicing of a subset of target genes11,31,32. Other steroid hormones also coordinate both transcription and splicing decisions29. The thyroid hormone receptor (TR) is known to play a role in coordinating the regulation of transcription and alternative splicing27, and the oestrogen receptor (ER) can both regulate alternative promoter selection and induce alternative splicing of specific gene sets that can influence breast cancer cell behaviour28,33–35.
In previous work we used exon level microarray analysis to identify 7 androgen dependent changes in mRNA isoform expression10. However, to what extent androgen-regulated mRNA isoforms are expressed in clinical prostate cancer is unclear. To address this, here we have used RNA-Sequencing data to globally profile alternative isoform expression in prostate cancer cells exposed to androgens, and correlated the results with transcriptomic data from clinical tissue. Our findings increase the number of known AR regulated mRNA isoforms by 10 fold and imply that pre-mRNA processing is an important mechanism through which androgens regulate gene expression in prostate cancer.
Cell culture was as described previously25,36. All cells were grown at 37°C in 5% CO2. LNCaP cells (CRL-1740, ATCC) were maintained in RPMI-1640 with L-Glutamine (PAA Laboratories, R15-802) supplemented with 10% Fetal Bovine Serum (FBS) (PAA Laboratories, A15-101). For androgen treatment of cells, medium was supplemented with 10% dextran charcoal stripped FBS (PAA Laboratories, A15-119) to produce a steroid-deplete medium. Following culture for 72 hours, 10 nM synthetic androgen analogue methyltrienolone (R1881) (Perkin-Elmer, NLP005005MG) was either added (Androgen +) or absent (Steroid deplete) for the times indicated.
RNA-seq transcript expression analysis of previously generated data25 was performed according to the Tuxedo protocol37. All reads were first mapped to human transcriptome/genome (build hg19) with TopHat38/Bowtie39, followed by per-sample transcript assembly with Cufflinks40. The mapped data was processed with Cuffmerge, Cuffdiff and Cuffcompare, followed by extraction of significantly differentially expressed genes/isoforms; expression changes between cells grown with androgen and cells grown without androgens were assessed. Reference files for the human genome (UCSC build hg19) were downloaded from the Cufflinks pages: (UCSC-hg19 package from June 2012 was used.). The software versions used for the analysis were: TopHat v1.4.1, SAM tools Version: 0.1.18 (r982:295), bowtie version 0.12.8 (64-bit) and cufflinks v1.3.0 (linked against Boost version 104000). The Tuxedo protocol37 was carried out as follows: For steps 1–5, no parameters (except for paths to input/output files) were altered. In step 15, additional switches -s, -R, and -C were used when running cuffcompare. Steps 16–18 (extraction of significant results) were performed on the command line.
Cells were harvested and total RNA extracted using TRIzol (Invitrogen, 15596-026) according to manufacturer's instructions. RNA was treated with DNase 1 (Ambion, AM2222) and cDNA was generated by reverse transcription of 500ng of total RNA using the Superscript VILO cDNA synthesis kit (Invitrogen, 11754-050). Alternative events were analysed by either reverse transcriptase PCR or real-time PCR. Exon profiles were monitored and quantified using the Qiaxcel capillary electrophoresis system (Qiagen) and percentage inclusion was calculated as described previously10. Real time PCR was performed in triplicate on cDNA using SYBR® Green PCR Master Mix (Invitrogen, 4309155) and the QuantStudio 7 Flex Real-Time PCR System (Thermo Fisher Scientific). Samples were normalised using the average of three reference genes, GAPDH, β -tubulin and actin. Ct values for each sample were calculated using SDS 2.4 software (Applied Biosystems) and relative mRNA expression was calculated using the 2-ΔΔCt method. All primer sequences are listed in Supplementary Table 1. Raw Ct values are given in Dataset 141.
The following commercial antibodies were used in the study: anti-RLN2 rabbit monoclonal (Abcam, ab183505 1:1000 dilution), anti-TACC2 rabbit polyclonal antibody (11407-1-AP, Proteintech 1:500 dilution), anti-NDUFV3 rabbit polyclonal antibody (13430-1-AP, Proteintech 1:500 dilution), anti-actin rabbit polyclonal (A2668, Sigma 1:2000 dilution), anti-α-Tubulin mouse monoclonal (Sigma, T5168 1:2000 dilution), normal rabbit IgG (711-035-152, Jackson labs 1:2000 dilution) and normal mouse IgG (715-036-150, Jackson labs 1:2000 dilution).
Gene ontology (GO) analysis of RNA-Seq data was carried out as described previously42. Enrichment of GO terms (with b500 annotations) was calculated using the goseq R package (version 1.18.0). Genes were considered significant at a p-value threshold of 0.05 after adjustment using the Benjamini-Hochberg false discovery rate.
Available clinical and processed RNA-Seq data from The Cancer Genome Atlas (TCGA) prostate adenocarcinoma (PRAD) cohort, comprising 497 tumour samples from as many patients with different stages / Gleason grades and 52 matched samples taken from normal prostate tissue (were downloaded from the Broad Institute TCGA Genome Analysis Center (Firehose 16/01/28 run https://doi.org/10.7908/C11G0KM943). Transcriptome data from the TCGA PRAD cohort were analysed for alternative isoform expression, with transcript models relying on TCGA GAF2.1, corresponding to the University of California, Santa Cruz (UCSC) genome annotation from June 2011 (hg19 assembly). This annotation encompassed 42 of the 73 androgen-regulated alternative mRNA isoform pairs identified. These were studied using two types of analysis: 1) differential transcript expression between tumour and normal prostate tissue and 2) correlation between isoform expression in tumour samples and Gleason score or tumour stage.
Differential isoform and gene expression analysis was performed on estimated read counts using the limma software R package (version 3.7) following its RNA-Seq analysis workflow44. This workflow was also used for differential isoform ratio analysis, relying on logit-transformed ratio (see below). An FDR-adjusted p-value of 0.05 for the moderated t-statistics was used as threshold for significance of differential expression. Individual isoform expression was estimated in TPM (transcripts per million mapped reads). The expression ratio, henceforth called PSI (percent spliced-in), of each annotated androgen-regulated isoform pair in each TCGA sample was calculated as the ratio between the expression of isoform 1 and the total expression of isoforms 1 and 2 combined, i.e. the sum of their expressions. For each isoform pair, ΔPSI is the difference of median PSI between the tumour and the normal groups of samples.
Two-tailed Spearman’s rank correlation tests were used to study the association between isoform expression and both Gleason score and tumour stage (these were used herein as numeric variables). An FDR-adjusted p-value of 0.05 was used as threshold for significance. Isoform expression differences between tumour and normal samples were considered equivalent to those detected in LNCaP cells under androgen stimulation when there was a statistically significant consistent change in the levels of the expected induced or repressed isoform (1 or 2), concomitant with no contradictory change in the PSI. Isoform “switches” were considered equivalent when there was a minimum (ΔPSI > 2.5%) and statistically significant consistent change in the PSI. Equivalent criteria were used to evaluate the equivalence between androgen-dependence and the associations with Gleason score and tumour stage.
Statistical analyses were conducted using the GraphPad Prism software (version 5.04/d). PCR quantification of mRNA isoforms was assessed using the unpaired student’s t-test.
Data is presented as the mean of three independent samples ± standard error of the mean (SEM). Statistical significance is denoted as * p ≤ 0.05, ** p ≤ 0.01, *** p ≤ 0.001 and **** p ≤ 0.0001.
We analysed previously published RNAseq data from LNCaP cells25 to globally profile how frequently androgens drive production of alternative mRNA isoforms in prostate cancer cells. This analysis identified a group of 73 androgen regulated alternative mRNA isoforms, which could be validated by visualisation on the UCSC Genome Browser45 (Table 1). 64 AR regulated mRNA isoforms were novel to this study. Experimental validation in an independent RNA sample set using RT-PCR confirmed 17/17 of these alternative events at the mRNA level (Supplementary Figure 1). 73% of genes (53/73) with identified alternative androgen regulated mRNA isoforms also changed their overall expression levels in response to androgens (Table 2). Some of the androgen regulated alternative events are in genes are already implicated in in either prostate cancer or other cancer types (summarised in Table 3). However, Gene Ontology analysis of these 73 genes did not identify any significantly enriched biological processes.
Isoform 1 | Isoform 2 | TCGA PRAD | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Gene | Event type | Position (hg19) | RefSeq | Position (hg19) | RefSeq | Change with androgens | PCR Validation | Predicted to change protein? | Isoform 1 ID | Isoform 2 ID | Comparable? |
LIG4 | Alternative promoter | chr13:108859792- 108870716 | NM_001098268.1 | chr13:108859792- 108867130 | NM_002312.3 | Induction of promoter 2 | Yes (Qiaxel) | No (5' UTR) | uc001vqp.2 | uc001vqn.2 | Yes |
TACC2 | Alternative promoter | chr10:123748689- 124014060 | NM_206862.3 | chr10:123872554- 124014060 | NM_001291879.1 | Repression of promoter 1 | Yes (Qiaxel) | Yes | uc001lfv.2 | uc001lfx.2 | Yes |
TPD52 | Alternative promoter | chr8:80947103- 81083894 | NM_001287144.1 | chr8:80947103- 80993066 | NM_001025252.2 | Induction of promoter 2 | Yes (Qiaxel) | Yes | uc003ybs.1 | uc003ybr.1 | Yes |
NUP93 | Alternative promoter | chr16:56764017- 56878861 | NM_014669.4 | chr16:56815704- 56878861 | NM_001242795.1 | Induction of promoter 1 | Yes (SYBR) | Yes | uc002eka.2 | uc002ekb.2 | Yes |
RLN1 | Alternative promoter | chr9:5334932- 5339873 | NM_006911.3 | chr9:5335270- 5339396 | Not annotated | Repression of promoter 2 | Yes (Qiaxel) | Yes (change from non- coding) | uc003zjb.1 | Not annotated | No |
AP2S1 | Alternative promoter | chr19:47341415- 47354252 | NM_001301078.1 | chr19:47341415- 47353547 | NM_001301076.1 | Induction of promoter 2 | Yes (SYBR) | Yes | uc002pft.1 | Not annotated | No |
RLN2 | Alternative promoter | chr9:5299866- 5304611 | NM_005059.3 | chr9:5299890- 5304222 | Not annotated | Induction of promoter 1 | Yes (Qiaxel) | Yes (change from non- coding) | uc003ziz.1 | Not annotated | No |
PIK3R1 | Alternative promoter | chr5:67511584- 67597649 | NM_181523.2 | chr5:67584252- 67597649 | NM_181524.1 | Repression of promoter 2 | Yes (SYBR) | Yes | uc003jva.2 | uc003jvc.2 | Yes |
MAPRE2 | Alternative promoter | chr18:32556892- 32723432 | NM_001143826.2 | chr18:32621324- 32723432 | NM_014268.3 | Switch to promoter 2 | Yes (Qiaxel) | Yes | uc010xcb.1 | uc002kyf.2 | Yes |
NDUFAF4 | Alternative promoter | chr6:97337187- 97345767 | NM_014165.3 | chr6:97337227- 97345368 | Not annotated | Repression of promoter 2 | Yes (Qiaxel) | Yes (change from non- coding) | uc003pov.2 | Not annotated | No |
DCXR | Alternative promoter | chr17:79993757- 79995573 | NM_016286.3 | chr17:79993765- 79995217 | Not annotated | Repression of promoter 2 | Yes (Qiaxel) | Yes | uc002kdg.2 | Not annotated | No |
PEX10 | Alternative promoter | chr1:2336241- 2344010 | NM_002617.3 | Not annotated | Switch to promoter 2 | Yes (Qiaxel) | Yes | uc001ajh.2 | Not annotated | No | |
SNAPC2 | Alternative promoter | chr19:7985194- 7988136 | NM_003083.3 | chr19:7985867- 7988136 | NR_030717.1 | Switch to promoter 2 | Yes (SYBR) | Yes (change to non- coding) | uc002miw.1 | uc002mix.1 | Yes |
ATP6V0D1 | Alternative promoter | chr16:67471917- 67515089 | NM_004691.4 | chr16:67471931- 67475338 | Not annotated | Repression of promoter 2 | Yes | uc002ete.1 | Not annotated | No | |
ARRDC1 | Alternative promoter | chr9:140500092- 140509812 | NM_001317968.1 | chr9:140506874- 140509793 | Not annotated | Induction of promoter 2 | Yes (change to non- coding) | uc004cnp.1 | Not annotated | No | |
DENND1A | Alternative promoter | chr9:126141933- 126692417 | NM_020946.1 | chr9:126143408- 126586780 | Not annotated | Repression of promoter 2 | Yes | uc004bnz.1 | Not annotated | No | |
KLHL36 | Alternative promoter | chr16:84682117- 84701292 | NM_024731.3 | chr16:84684274- 84701134 | Not annotated | Induction of promoter 2 | Yes | uc002fig.2 | Not annotated | No | |
RAB3IL1 | Alternative promoter | chr11:61664768- 61687741 | NM_001271686.1 | chr11:61664768- 61685081 | NM_013401.3 | Repression of promoter 2 | Yes | uc001nsp.2 | uc001nso.2 | Yes | |
ACER3 | Alternative promoter | chr11:76571917- 76737841 | NM_018367.6 | chr11:76631206- 76737818 | Not annotated | Repression of promoter 2 | Yes | uc009yum.1 | Not annotated | No | |
OSBPL1A | Alternative promoter | chr18:21742011- 21977833 | NM_080597.3 | chr18:21742011- 21852196 | NM_018030.4 | Induction of promoter 2 | Yes | uc002kve.2 | uc002kvd.2 | Yes | |
TRIM16 | Alternative promoter | chr17:15531280- 15586193 | NM_006470.3 | chr17:15530970- 15555735 | Not annotated | Induction of promoter 2 | Yes | uc002gow.2 | Not annotated | No | |
VSIG10L | Alternative promoter | chr19:51834795- 51845378 | NM_001163922.1 | chr19:51834795- 51843009 | Not annotated | Induction of promoter 1 | Yes | uc002pwf.2 | Not annotated | No | |
SEPT5 | Alternative promoter | chr22:19701987- 19710845 | NM_002688.5 | chr22:19705958- 19710845 | NM_001009939.2 | Repression of promoter 2 | Yes | uc002zpv.1 | uc002zpw.1 | Yes | |
HMGCR | Alternative promoter | chr5:74632154- 74657926 | NM_000859 | chr5:74632993- 74657926 | NM_000859.2 | Repression of promoter 1 | Yes | uc011cst.1 | uc003kdp.2 | Yes | |
RDH13 | Alternative promoter | chr19:55555692- 55580914 | NM_138412.3 | chr19:55555692- 55574585 | NM_001145971.1 | Induction of promoter 1 | Yes | uc002qip.2 | uc010esr.1 | Yes | |
GPRIN2 | Alternative promoter | chr10:46993001- 47000677 | Not annotated | chr10:46993546- 47000568 | NM_014696.3 | Repression of promoter 2 | No (5' UTR) | Not annotated | uc001jec.2 | No | |
CLK3 | Alternative promoter | chr15:74900713- 74922542 | NM_003992.4 | chr15:74,908,246- 74,922,542 | NM_003992 | Repression of promoter 1 | Yes | uc002ayg.3 | uc002ayj.3 | Yes | |
RNH1 | Alternative promoter | chr11:494512- 507283 | NM_203387.2 | chr11:494512- 506821 | NM_002939.3 | Induction of promoter 1 | No (5' UTR) | uc001lpp.1 | uc001lpl.1 | Yes | |
ZFAND6 | Alternative promoter | chr15:80351910- 80430735 | NM_001242911.1 | chr15:80364903- 80430735 | NM_001242916.1 | Repression of promoter 2 | No (5' UTR) | uc002bff.1 | uc002bfh.1 | Yes | |
CDIP1 | Alternative promoter | chr16:4560677- 4588816 | NM_013399.2 | chr16:4560677- 4588471 | NM_001199054.1 | Repression of promoter 2 | No (5' UTR) | uc002cwu.2 | uc002cwv.2 | Yes | |
YIF1B | Alternative promoter | chr19:38794200- 38806606 | NM_001039672.2 | chr19:38794200- 38806445 | NM_001145461.1 | Switch to promoter 2 | Yes | uc002ohz.2 | uc002ohx.2 | Yes | |
LIMK2 | Alternative promoter | chr22:31608250- 31676066 | NM_005569.3 | chr22:31644348- 31676066 | NM_016733.2 | Switch to promoter 2 | Yes | uc003akh.2 | uc003aki.2 | Yes | |
TSC22D3 | Alternative promoter | chrX:106956452- 106959711 | NM_001015881.1 | chrX:106956452- 106960291 | NM_004089.3 | Repression of promoter 1 | Yes | uc004enf.2 | uc004eng.2 | Yes | |
ALDH1A3 | Alternative promoter | chr15:101419897- 101456830 | NM_000693.3 | chr15:101438281- 101457072 | Not annotated | Repression of promoter 1 | Yes | uc002bwn.3 | Not annotated | No | |
TRABD | Alternative promoter | chr22:50624341- 50638028 | NM_001320485.1 | chr22:50628979- 50638028 | NM_001320487.1 | Switch to promoter 2 | No (5' UTR) | uc003bjq.1 | uc003bjs.1 | Yes | |
LIMCH1 | Alternative promoter | chr4:41361624- 41702061 | NM_001289124.1 | chr4:41362648- 41702061 | NM_001289122.2 | Repression of promoter 2 | Yes | uc003gvu.3 | Not annotated | No | |
GMFB | Alternative promoter | chr14:54941209- 54955744 | NM_004124.2 | chr14:54941314- 54955637 | Not annotated | Induction of promoter 2 | Yes (change to non- coding) | uc010tqz.1 | Not annotated | No | |
MLST8 | Alternative promoter | chr16:2255178- 2259418 | NM_022372.4 | chr16:2255732- 2259418 | NM_001199174.1 | Switch to promoter 1 | No (5' UTR) | uc010uvy.1 | uc002cpf.2 | Yes | |
TLE3 | Alternative promoter | chr15:70340130- 70390256 | NM_020908.2 | chr15:70340130- 70387124 | NM_001282982.1 | Induction of promoter 2 | Yes | uc002asn.2 | uc002ask.2 | Yes | |
UBA1 | Alternative promoter | chrX:47050199- 47074527 | NM_153280.2 | chrX:47053201- 47074527 | NM_003334.3 | Repression of promoter 1 | No (5' UTR) | uc004dhj.3 | uc004dhk.3 | Yes | |
TNRC6B | Alternative promoter | chr22:40440821- 40731812 | NM_001024843.1 | chr22:40573929- 40731812 | NM_001162501.1 | Repression of promoter 2 | Yes | uc003aym.2 | uc011aor.1 | Yes | |
FDFT1 | Alternative promoter | chr8:11660120- 11696818 | NM_004462.4 | chr8:11665926- 11696818 | NM_001287750.1 | Repression of promoter 2 | Yes | uc003wui.2 | uc010lsb.2 | Yes | |
GREB1 | Alternative promoter | chr2:11674242- 11782912 | NM_014668.3 | chr2:11680080- 11728355 | NM_148903.2 | Induction of promoter 2 | Yes | uc002rbo.1 | uc002rbl.2 | Yes | |
NCAPD3 | Alternative promoter | chr11:134022337- 134094426 | NM_015261.2 | chr11:134022772- 134093593 | Not annotated | Induction of promoter 2 | Yes | uc001qhd.1 | Not annotated | No | |
SLC36A4 | Alternative promoter | chr11:92877337- 92931141 | NM_152313.3 | chr11:92877337- 92930621 | NM_001286139.1 | Induction of promoter 2 | Yes | uc001pdn.2 | Not annotated | No | |
KLC2 | Alternative promoter | chr11:66024765- 66035331 | NM_001134775.1 | chr11:66025174- 66035331 | NM_022822.2 | Repression of promoter 1 | No (5' UTR) | uc010rov.1 | uc001ohb.2 | Yes | |
RAP1GAP | Alternative promoter | chr1:21922708- 21978348 | NM_001145658.1 | chr1:21922533- 21946950 | Not annotated | Repression of promoter 1 | Yes | uc001bez.1 | Not annotated | No | |
TMEM79 | Alternative promoter | chr1:156252704- 156262234 | NR_026678.1 | chr1:156254070- 156262234 | NM_032323.2 | Repression of promoter 1 | No (5' UTR) | uc001fod.2 | uc010phi.1 | Yes | |
NR4A1 | Alternative promoter | chr12:52416616- 52453291 | NM_001202233.1 | chr12:52445186- 52453291 | NM_173157.2 | Induction of promoter 2 | Yes | uc010sno.1 | uc001rzr.2 | Yes | |
ZNF32 | Alternative promoter | chr10:44139307- 44144326 | NM_001324166.1 | chr10:44139307- 44144326 | NM_001324167.1 | Repression of promoter 2 | No (5' UTR) | uc001jbc.2 | uc001jbb.2 | Yes | |
C1QTNF3 | Alternative promoter | chr5:34017963- 34043371 | NM_181435.5 | chr5:34018571- 34035881 | Not annotated | Induction of promoter 1 | Yes | uc003jio.2 | Not annotated | No | |
UBE2D3 | Alternative promoter | chr4:103715540- 103748710 | NM_181887.2 | chr4:103715540- 103749105 | NM_181886.3 | Switch to promoter 2 | No (5' UTR) | uc003hwk.2 | uc011cet.1 | Yes | |
KRT8 | Alternative promoter | chr12:53290971- 53343650 | NM_001256293.1 | chr12:53,290,971- 53,298,868 | NM_002273 | Repression of promoter 1 | No (5' UTR) | uc009zml.1 | uc001sbd.2 | Yes | |
ELOVL1 | Alternative promoter | chr1:43829068- 43833745 | NM_022821.3 | chr1:43829093- 43832057 | Not annotated | Induction of promoter 2 | Yes (change to non- coding) | uc001cjb.2 | Not annotated | No | |
RCAN1 | Alternative promoter | chr21:35888740- 35987441 | NM_004414.6 | chr21:35888740- 35899308 | NM_203418.2 | Induction of promoter 2 | Yes | uc002yue.2 | uc002yub.2 | Yes | |
SORBS3 | Alternative promoter | chr8:22409251- 22433008 | NM_005775.4 | chr8:22422332- 22433100 | Not annotated | Induction of promoter 2 | Yes | uc003xbv.2 | Not annotated | No | |
MAT2A | Alternative 3' end | chr2:85766101- 85772403 | NM_005911.5 | chr2:85,766,101- 85,770,775 | NM_005911 | Repression of isoform 2 | Yes (Qiaxel) | Yes | uc002spr.2 | uc010ysr.1 | Yes |
CNNM2 | Alternative 3' end | chr10:104678075- 104687375 | NM_199077.2 | chr10:104678075- 104838344 | NM_017649.4 | Induction of isoform 1 | Yes (SYBR) | Yes | uc001kwl.2 | uc001kwm.2 | Yes |
TMEM125 | Alternative 3' end | chr1:43735698- 43736343 | Not annotated | chr1:43735665- 43739673 | NM_144626.2 | Induction of isoform 1 | Yes (change to non- coding) | Not annotated | uc001cir.2 | No | |
CBWD2 | Alternative 3' end | chr2:114195268- 114253781 | NM_172003.3 | chr2:114195169- 114199073 | Not annotated | Induction of isoform 2 | Yes | uc002tju.2 | Not annotated | No | |
NDUFV3 | Alternative exon | chr21:44313378- 44329773 | NM_021075.3 | chr21:44313378- 44329773 | NM_001001503.1 | Switch to isoform 2 (exon excluded) | Yes | uc002zcm.2 | uc002zcn.2 | Yes | |
ZNF678 | Alternative exon | chr1:227751220- 227850164 | NM_178549.3 | Not annotated | Switch to isoform 2 (exon excluded) | Yes (change to non- coding) | uc009xet.1 | Not annotated | No | ||
ZNF121 | Alternative exon | chr19:9676404- 9695209 | NM_001308269.1 | chr19:9676404- 9695209 | NM_001008727.3 | Switch to isoform 2 (exon excluded) | Yes | uc010xkq.1 | uc010xkp.1 | Yes | |
SPATC1L | Alternative exon | chr21:47581062- 47604373 | NM_032261.4 | Not annotated | Induction of isoform 2 (exon included) | Yes | uc002zii.2 | Not annotated | No | ||
MOCOS | Alternative exon | chr18:33767480- 33848685 | NM_017947.2 | Not annotated | Switch to isoform 2 (exon excluded) | Yes | uc002kzq.3 | Not annotated | No | ||
RBM45 | Alternative exon | chr2:178977151- 178994382 | NM_152945.3 | Not annotated | Switch to isoform 2 (exon included) | Yes | uc002ulv.2 | Not annotated | No | ||
MIPEP | Alternative exon | chr13:24304328- 24463587 | NM_005932.3 | Not annotated | Repression of isoform 2 (exon excluded) | Yes | uc001uox.3 | Not annotated | No | ||
BBS4 | Alternative exon | chr15:72978520- 73030817 | NM_001320665.1 | Not annotated | Induction of isoform 2 (exon included) | Yes | uc002avb.2 | Not annotated | No | ||
FAM195A | Alternative exon | chr16:691804- 698474 | NM_138418.3 | chr16:691804- 698474 | NR_138607.1 | Switch to isoform 1 (exon exluded) | Yes (change from non- coding) | uc002cic.1 | uc002cie.2 | Yes | |
LINC01133 | Alternative exon | chr1:159931008- 159948851 | ENST00000443364.6 | chr1:159931014- 159948876 | NR_038849.1 | Induction of isoform 1 (exon excluded) | Both non-coding | Not annotated | uc001fuu.2 | No | |
SS18 | Alternative exon | chr18:23596217- 23670611 | NM_001007559.2 | chr18:23596217- 23670611 | NM_005637.3 | Switch to isoform 2 (exon excluded) | Yes | uc002kvm.2 | uc002kvn.2 | Yes | |
RHOC | Alternative exon | chr1:113243897- 113249757 | ENST00000369638.6 | chr1:113243947- 113249742 | ENST00000369636.6 | Switch to isoform 2 (exon excluded) | No (5' UTR) | uc009wgk.1 | uc001ecr.1 | Yes | |
ZNF226 | Retained intron | chr19:44669215- 44681838 | NM_001319088.1 | chr19:44669249- 44679582 | NM_015919.3 | Switch to isoform 1 (intron included) | Yes | uc002oyo.2 | uc002oyn.2 | Yes |
Gene name | Function | Clinical importance and roles in other cancer types | Clinical importance and roles in prostate cancer |
---|---|---|---|
TACC2 Transforming Acidic Coiled- Coil Containing Protein 2 | centrosome- and microtubule-interacting protein | Growth and prognosis of breast cancer56 | castration-resistant growth of prostate cancer57 |
LIG4 | DNA ligase with role in DNA repair | Prognostic marker in nasopharyngeal cancer58 Upregulated in colorectal cancer with role in wnt signalling59 | Predictor of poor prognosis60 |
RLN1 and RLN2 (Relaxin1 and 2) | Endocrine hormones (part of insulin gene superfamily) | Breast cancer invasiveness61,62 metastasis of human osteosarcoma63 Thyroid cancer oncogenesis64,65 | Well characterised role in the development and progression of prostate cancer5,50–55. |
TPD52 (Tumor Protein D52) | Role in proliferation and exo- and endocytic pathways | Well characterised role in numerous cancer types46,66–69 | Known AR target, overexpressed and amplified in prostate cancer70 Oncogene in prostate cancer71 Neuroendocrine transdifferentiation of prostate cancer72 Isoform produced by alternative promoter known as PrLZ and already linked to prostate cancer47–49,73,74 |
FDFT1 (Farnesyl-Diphosphate Farnesyltransferase 1) | squalene synthase | Role in lung cancer metastasis75 | Linked to prostate cancer risk and aggressiveness76 |
TLE3 (Transducin Like Enhancer Of Split 3) | Negative regulator of Wnt/β- catenin signaling | Predictive marker for response to therapy in ovarian and breast cancer77,78 Represses colon cancer proliferation79 | Upregulated in prostate tumours80 and linked to wnt signalling in castrate resistant disease81 |
CNNM2 (Cyclin & CBS Domain Divalent Metal Cation Transport Mediator 2) | Magnesium transporter | Proposed oncogenic role via increasing magnesium uptake82 | Unknown |
NUP93 | Nucleoporin protein – role in apoptosis | Driver mutation linked to breast cancer83 | Unknown |
MAT2A Methionine adenosyltransferase II | Biosynthesis of S-adenosylmethionine, the principal biological methyl donor and precursor of polyamines and glutathione. | Upregulated in liver and colon cancer, potential drug target84,85 Tumour suppressor in kidney carcinogenesis86 Role in other cancer types87 | Upregulated in prostate cancer and linked to cell migration via miR-34a and miR- 34b87,88 |
PIK3R1 | PI3K regulatory subunit | Underexpressed in breast cancer89 High mutation frequency in endometrial cancer90 | Controlled by androgens and repressed in prostate cancer cells21 |
SNAPC2 (Small Nuclear RNA Activating Complex Polypeptide 2) | Subunit of the snRNA- activating protein complex. Necessary for RNA polymerase II and III dependent small-nuclear RNA gene transcription | Epigenetic silencing is prognostic in glioblastoma91 | Unknown |
ZNF678 (Zinc Finger Protein 678) | Potential role in transcriptional regulation | Unknown | Unknown |
NDUFV3 (NADH:Ubiquinone Oxidoreductase Subunit V3) | Subunit of part of the mitochondrial respiratory chain | Unknown | Androgen regulated alternative splice isoform previously identified by our exon array study10 |
OSBPL1A (Oxysterol Binding Protein Like 1A) | Intracellular lipid receptor | Alternative promoter use in colorectal cancer92 | Unknown |
RDH13 (Retinol Dehydrogenase 13) | Role in retinoic acid production and protection against oxidative stress | Unknown | Unknown |
ZNF121 (Zinc Finger Protein 121) | Potential role in transcriptional regulation | Interacts with MYC. Upregulated in breast cancer93 | Unknown |
SLC36A4.1 (Solute Carrier Family 36 Member 4) | amino acid transporter | Unknown | Unknown |
RCAN1 (Regulator of Calcineurin 1) | Inhibits calcineurin- dependent signaling pathways | Inhibits NF-κB and suppresses lymphoma growth in mice94. Role in cancer cell migration95 | Unknown |
DCXR (Dicarbonyl & l-xylulose reductase) | Role in the uronate cycle of glucose metabolism | Low expression indicates poor prognosis for hepatocellular carcinoma96. Role in cell adhesion97,98 | Upregulated and potential biomarker in prostate cancer99 |
NDUFAF4 (NADH:Ubiquinone Oxidoreductase Complex Assembly Factor 4) | Role in the mitochondrial respiratory chain | Unknown | Unknown |
MAPRE2 (Microtubule Associated Protein RP/EB Family Member 2) | Microtubule-associated protein that is necessary for spindle symmetry during mitosis | Role in the invasion of pancreatic cancer cells100 | Unknown |
PEX10 (Peroxisomal Biogenesis Factor 10) | Involved in import of peroxisomal matrix proteins | Unknown | Unknown |
AP2S1 (Adaptor Related Protein Complex 2 Sigma 1 Subunit) | Function in protein transport across membranes | Unknown | Unknown |
LINC01133 (long non-coding RNA) | Long non-coding RNA | Poor prognosis in colorectal cancer101 Upregulated and linked to poor prognosis in lung cancer102 | Unknown |
ZNF226 (Zinc Finger Protein 226) | Potential role in transcriptional regulation | Unknown | Unknown |
CDIP1 (Cell death inducing p53 target 1) | p53 apoptotic effector Regulates TNF-alpha- mediated apoptosis | sensitivity to TNFα- induced apoptosis in cancer cells103 | Unknown |
The 73 identified mRNA isoforms were generated via androgen-regulated utilisation of 56 alternative promoters, 4 alternative 3′ ends and 13 alternative splicing events (Figure 1A). Of the 56 androgen regulated alternative promoters that were identified, 23 alternative promoters were induced by androgens (including LIG4, Figure 1B), 26 promoters were repressed by androgens, and for 7 genes there was a switch in usage from one promoter to another (Table 1). The alternative splicing events that were under androgen control included 12 alternative exons and one androgen-regulated intron retention (Table 1). 10 of these are novel to this study, including exclusion of an alternative exon in ZNF678 (Figure 1C). Of the alternative exons, six genes contained switches in previously unannotated protein-coding exons in response to androgen-exposure. We also identified four androgen regulated alternative mRNA 3' end isoform switches, including a switch in the 3’ end of the mRNA transcript for the MAT2A gene (Figure 1D).
48/73 (66%) of the androgen regulated alternative events detected in response to androgen stimulation are predicted to change the amino acid sequence of the resulting protein (Table 1). Some of these are already known to have a well characterised role in prostate cancer progression, including an alternative promoter in the oncogene TPD52 that produces a protein isoform called PrLZ (Figure 2A)46–49. Others are not so well characterised. Using western blotting we could detect a novel shorter protein isoform corresponding to androgen-driven selection of an alternative promoter in the TACC2 gene (Figure 2B); and exclusion of a cassette exon in the NDUFV3 gene, which we show also produces a novel shorter protein isoform (Figure 2C). We also detected a switch in the 3' end of the mRNA transcript for the MAT2A gene, which is predicted to produce a protein isoform with a shorter C-terminal domain (Figure 1D); and induction of an alternative 3' isoform of CNNM2, which is predicted to be missing a conserved CBS domain (Table 1 and Supplementary Figure 1).
11 of the remaining identified androgen-regulated alternative events change the expression of mRNAs from coding to non-coding or untranslated (not predicted to produce a protein) (Table 1). These included promoter switches for the RLN1 and RLN2 genes which encode peptide hormones that may be important in prostate cancer5,50–55. Androgens drive a promoter switch in both RLN1 and RLN2 to produce predicted non-coding or untranslated mRNA isoforms, reducing expression of protein-coding RLN1 and RLN2 mRNA isoforms. To test whether prostate cancer cells turn off gene expression by switching between utilisation of promoters that generate coding and noncoding mRNAs, we analysed RLN2 protein levels. Consistent with our hypothesis and a previous study55, RLN2 protein production was negatively regulated by androgens in parallel to the switch to the non-coding mRNA isoform (Figure 2D).
14 of the identified androgen-dependent mRNA isoforms lead to/result in coding mRNAs with altered 5’ untranslated regions (5′ UTR) with no impact on the coding sequence. These include a promoter switch in the LIG4 gene (Figure 1B).
To investigate potential links between androgen-dependent mRNA isoforms and tumourigenesis, we analysed the expression of 41 androgen-regulated mRNA isoform pairs in clinical prostate adenocarcinoma and normal prostate tissues. This analysis utilised transcriptomic data from 497 tumour samples and 52 normal samples in the PRAD TCGA cohort104. The remaining isoform pairs identified within our dataset have not been previously annotated by UCSC, therefore it was not possible to include them in our comparison. A description of the cohort used is summarised in Table 4.
33 of the 42 mRNA isoform pairs exhibited significant differences in the expression of at least one of the isoforms, or in the isoform expression ratio between tumour and normal tissues (Table 5). 13 of those tumour-specific alterations mimicked the effect of androgen stimulation in LNCaP cells: the changes were in form of alternative promoters for TACC2, TPD52, NUP93, PIK3R1, RDH13, ZFAND6, CDIP1, YIF1B, LIMK2, and FDFT1; an alternative 3´ end in CNNM2; and alternative exons in NDUFV3 and SS18 (Figure 3, Table 5 & Supplementary Figure 2). Two of the alternative promoters (ZFAND6 and CDIP1) are predicted to introduce a change in the 5′UTR, whereas all the others are predicted to alter the resulting protein isoform. A number of mRNA isoforms that were androgen responsive in LNCaP cells showed tumour specific alterations opposite to the effect of androgen stimulation. These were LIG4, MAPRE2, OSBPL1A, SEPT5, NR4A1, and RCAN1 (all predicted to alter the resulting protein isoform except LIG4). For the remaining 14 mRNA isoform pairs, the data was inconclusive according to the consistency conditions listed in the methods section (Table 5).
We next investigated whether the identified androgen-dependent mRNA isoforms are differentially expressed during prostate cancer progression by correlating the expression levels of each isoform with Gleason scores and prostate tumour grades within the PRAD TCGA cohort (Figure 4 & Figure 5, Table 6 & Table 7 and Supplementary Figure 3 & Supplementary Figure 4). For 6 of the alternative mRNA isoforms responsive to androgens (made from alternative promoters in LIG4, OSBPL1A, CLK3, TSC22D3 & ZNF32 and utilising an alternative exon in ZNF121), the expression changed significantly with Gleason score and showed specific alterations consistent with the effect of androgen stimulation. Conversely, 9 alternative isoforms (which were androgen responsive in LNCaP cells) showed tumour specific alterations opposite to the effect of androgen stimulation (including an alternative promoters in NUP93 and the alternative 3´end of MAT2A). 3 androgen regulated mRNA isoforms (OSBPL1A, CLK3 and TSC22D3) change significantly with both Gleason grade and tumour stage.
The main function of the androgen receptor (AR) is as a DNA binding transcription factor that regulates gene expression. Here we show the AR can couple hormone induced gene transcription to alternative mRNA isoform expression in prostate cancer. In response to androgens, the AR can induce the use of alternative promoters, induce the expression of alternatively spliced mRNA isoforms, regulate the expression of non-coding mRNA transcripts, and promote the transcription of mRNA isoforms encoding different protein isoforms. Importantly, we also find that some of these alternative mRNA isoforms are differentially regulated in prostate cancer versus normal tissue and also significantly change expression during tumour progression. Our data suggest that most androgen regulated alternative mRNA isoforms are generated through alternative promoter selection rather than through internal alternative exon splicing mechanisms. This suggests expression of alternative isoforms of specific genes can be a consequence of RNA polymerase being recruited to different promoters in response to androgen stimulation. Alternative promoter usage has been observed for many genes and is believed to play a significant role in the control of gene expression4,105,106. Alternative promoter use can also generate mRNA isoforms with distinct functional activities from the same gene, sometimes having opposing functions11.
Androgen exposure further drives a smaller number of alternative splicing events suggesting that the AR could contribute to altered patterns of splicing in prostate cancer cells. Tumour progression is believed to be associated with a coordinated change in splicing patterns which is regulated by several factors including signalling molecules7. We also identified 4 AR regulated alternative mRNA 3′ end isoform switches. This is the first time that regulation of 3′ mRNA end processing has been shown to be controlled by androgens. The selection of alternative 3′ ends can produce mRNA isoforms differing in the length of their 3′ UTRs (which can lead to the inclusion or exclusion of regulatory elements and influence gene expression), or in their C-terminal coding region (which can contribute to proteome diversity)107–114. Defective 3′ mRNA processing of numerous genes has been linked to an oncogenic phenotype115–119, and the 3′ mRNA end profiles of samples from multiple cancer types significantly differ from those of healthy tissue samples115,119–121.
Based on the findings presented in this study, we propose that activated AR has the ability to coordinate both transcriptional activity and mRNA isoform decisions through the recruitment of co-regulators to specific promoters. The genomic action of the AR is influenced by a large number of collaborating transcription factors122–124. Specifically, Sam68 and p68 have been shown to modulate AR dependent alternative splicing of specific genes and are significantly overexpressed in prostate cancer31,32. In future work it will be important to define the role of specific AR co-regulators in AR mediated isoform selection.
Some of the androgen dependent mRNA isoforms identified here are predicted to yield protein isoforms that may be clinically important, or to switch off protein production via generation of noncoding mRNA isoforms. Although the functional significance of the alternative mRNA isoforms identified in this study is yet largely unexplored, as is their role in the cellular response to androgens, the presented results emphasize the importance of analysing gene regulation and function at the mRNA isoform level.
The RNASeq data from LNCaP cells has been published previously https://doi.org/10.1016/j.ebiom.2016.04.01825
The RNAseq custom tracks are available in Supplementary File 1. To view these files please load them onto the UCSC website using the ‘My data’ tab and ‘custom tracks’. Then ‘Paste URLs or data’. The data is aligned to Feb 2009 (GRCh37/hg19).
Prostate adenocarcinoma cohort RNA-Seq data was downloaded from the Broad Institute TCGA Genome Analysis Center: Firehose 16/01/28 run https://doi.org/10.7908/C11G0KM943
Dataset 1: Real-time PCR raw Ct values 10.5256/f1000research.15604.d21287341
Dataset 2: Raw unedited western blot images 10.5256/f1000research.15604.d212874125
This work was funded by Prostate Cancer UK [PG12-34, S13-020 and RIA16-ST2-011].
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Supplementary Table 1: Details of primer sequences used.
Click here to access the data.
Supplementary File 1: RNA-Seq reads custom tracks for visualisation on UCSC genome browser
Click here to access the data.
Supplementary Figure 1: PCR validation of 17 androgen regulated alternative events.
Click here to access the data.
Supplementary Figure 2: Differential alternative mRNA isoform expression in theTGCA PRAD cohort. Normal vs. tumour (unpaired samples) analysis. Violin-boxplots of expression in transcripts per million mapped reads (TPM) of Isoforms 1 (left panel) and 2 (central panel), and of their expression ratio in PSI (right panel) in normal and tumour samples. The mean log2 fold-change (logFC) in expression between tumour and normal samples and the associated FDR-adjusted p-value for the moderated t-statistic of differential expression are shown for both isoforms (left and central panels). The mean difference in PSI (deltaPSI) between tumour and normal samples and the associated FDR-adjusted p-value for the Mann-Whitney U test of differential splicing are shown (right panel).
Click here to access the data.
Supplementary Figure 3: Differential alternative mRNA isoform expression in the TGCA PRAD cohort across different Gleason grades. Violin-boxplots of expression in transcripts per million mapped reads (TPM) of Isoforms 1 (left panel) and 2 (central panel), and of their expression ratio (right panel) by Gleason grade. Their respective Spearman’s correlation coefficient (Rho) with grade and associated FDR-adjusted p-value are shown.
Click here to access the data.
Supplementary Figure 4: Differential alternative mRNA isoform expression in the TGCA PRAD cohort across different tumour stages. Violin-boxplots of expression in transcripts per million mapped reads (TPM) of Isoforms 1 (left panel) and 2 (central panel), and of their expression ratio (right panel) by tumour stage. Their respective Spearman’s correlation coefficient (Rho) with stage and associated FDR-adjusted p-value are shown.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Partly
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Transcription and alternative splicing, transcriptomics
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Yes
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: alternative splicing; prostate cancer; diabetes (renal complications)
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | |||
---|---|---|---|
1 | 2 | 3 | |
Version 1 03 Aug 18 |
read | read | read |
Click here to access the data.
Spreadsheet data files may not format correctly if your computer is using different default delimiters (symbols used to separate values into separate cells) - a spreadsheet created in one region is sometimes misinterpreted by computers in other regions. You can change the regional settings on your computer so that the spreadsheet can be interpreted correctly.
Click here to access the data.
Spreadsheet data files may not format correctly if your computer is using different default delimiters (symbols used to separate values into separate cells) - a spreadsheet created in one region is sometimes misinterpreted by computers in other regions. You can change the regional settings on your computer so that the spreadsheet can be interpreted correctly.
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)