Published November 13, 2020 | Version 3
Dataset Open

Gene expression and splicing counts from the Kremer et al study

  • 1. Technical University of Munich

Description

File description:

  1. geneCounts: gene-level counts 

  2. k_j: split counts spanning from one exon to another.

  3. k_theta: non-split counts covering a splice site

  4. n_psi3: total split counts from a given acceptor site

  5. n_psi5: total split counts from a given donor site

  6. n_theta: total split and non-split counts for a given splice site

  7. Sample annotation describing each sample from the dataset

  8. Description file with global information from the dataset

 

The gene counts were originated using the GTF file from release 34 of GENCODE https://www.gencodegenes.org/human/release_34, and the split and non-split counts contain only the annonated junctions from the same release.

Use: The count matrices are intended to help researchers that are interested in using RNA-Seq data with the purpose of diagnostics. Researchers can merge their own dataset with the downloaded ones, provided the tissue, genome build, strand, and paired-end specifications match. Afterwards, the DROP pipeline can be used to compute expression and splicing outliers (https://github.com/gagneurlab/drop).

Number of samples: 119
Tissue: Fibroblast
Organism: Homo sapiens
Genome assembly: hg19
Gene annotation: gencode34
Disease (ICD-10: N): E75: 1, E79: 13, E88: 84, G31: 9, K72: 3, NONE: 9
Strand specific: FALSE
Paired end: TRUE
Protocol: poly(A) enrichment
Dataset contact: Vicente Yepez, yepez@in.tum.de; Christian Mertes, mertes@in.tum.de; Julien Gagneur, gagneur@in.tum.de; Holger Prokisch, prokisch@helmholtz-muenchen.de

Citation: Cite both the resource using Zenodo's citation and the publication under References

Files

Files (133.5 MB)

Name Size Download all
md5:1d8604836106c7f09b594d480c97cffe
133.5 MB Download

Additional details

References