Prediction of transcription factor bindings sites affected by SNPs located at the osteopontin promoter

This data contains information related to the research article entitled “Osteopontin splice variants and polymorphisms in Cancer Progression and Prognosis” [1]. Here, we describe an in silico analysis of transcription factors that could have altered binding to their DNA target sequence as a result of SNPs in the osteopontin gene promoter. We concentrated on SNPs associated with cancer risk and development. The analysis was performed with PROMO v3.0.2 software which incorporates TRANSFACT v6.4 of. We also present a figure depicting the putative transcription factor binding according to genotype.


Subject area
Biology, Molecular Biology More specific subject area

Effect of SNPs in binding of transcription factors for the gene osteopontin
Type of data

Value of the data
These data describe how putative DNA-binding sites for transcriptional factors can be created or interrupted by the changes in sequences generated by SNPs in the promoter of osteopontin.
Differential binding among SNPs genotypes can potentially explain why these SNPs have been associated with changes in the risk of cancer for a specific population. This analysis is an example of how important databases, such as those containing SNP genotypes and the predictive tools for DNA-binding sites for transcriptional factors in a specific sequence, could be used to try to select potential signaling pathways modulating the development of cancer.

Data
The table provided in this article is a list of the transcription factors predicted to bind a DNA sequence at the SNPs contained in the osteopontin promoter. We analyzed only those SNPs that statistically in a population have been shown to have an effect on cancer risk and prognosis for the carriers. For each SNP we present both sequences. Each analysis contains the rs ID and the nucleotide position in reference to the osteopontin promoter; a schematic representation of the binding of the transcription factor to their target sequence; and an analysis of how similar the binding site is compared to its canonical binding sequence.

Experimental design, materials and methods
Analysis of SNP sequences was performed using software PROMO v3.0.2, (which utilizes TRANS FAC v6.4) [2,3] For each osteopontin gene promoter SNP, the sequences carrying each allele were loaded as the query sequence to search for potential binding sites. The prediction was carried out considering only sites and only human transcription factors. The output of this analysis is presented in Table 1. Each analysis contains the rs that corresponds to each SNP and its position relative to the transcription start site of osteopontin. For each SNP, we present the respective results for both sequences loaded as the query sequences. A schematic representation (boxes in color, also indicated with numbers) of the binding of the transcription factor to the target sequence, and a list of the putative transcription factors binding to the sequence. For each transcription factor site, several predicted parameters are reported. The transcription Table 1 Transcription factors binding prediction to sequences associated to SNPs genotypes located in the promoter of the osteopontin gene. factor name with the database accession number in brackets; the start and end positions of the putative binding sequences; Dissimilarity (%), which corresponds to the rate of dissimilarity between the putative and consensus sequences for a given transcription factor; Sequence, the nucleotide sequence of potential binding site; Random Expectation (RE) indicating the expected occurrences of the match in a random sequence of the same length as the query sequence according to the dissimilarity index, presented the RE equally (equi-probability for the four nucleotides) and RE query (nucleotide frequencies as in the query sequence). Markedly different changes are highlight in grey and the SNP is highlight in red. In Fig. 1 we depict the integration of information obtained from this predictive analysis and data previously reported for transcription factors binding to the osteopontin promoter.