Hostname: page-component-848d4c4894-p2v8j Total loading time: 0 Render date: 2024-06-11T07:39:52.912Z Has data issue: false hasContentIssue false

Permutations and Computational Power: A Molecular Cascade Analysis to Approach Big Data in Psychiatry

Published online by Cambridge University Press:  23 March 2020

A. Drago*
Affiliation:
Aarhus university- Denmark, department of clinical medicine- Psykiatrisk Forskningsenhed Vest, Herning, Denmark

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

In the last few years, we conducted a number of molecular pathway analyses on the genetic samples provided by the NIMH. The molecular pathway approach accounts for the polygenic nature of the most part of psychiatric disorders. Nevertheless, the limits of this approach including the limited knowledge about the function of the genes, the fact that longer genes have higher probability to harbour variations significantly associated with the phenotype under analysis and the false positive associations for single variations, demand statistical control and bio-statistical knowledge. Permutations are a methodology to control for false positive associations, but their implementation requires that a number of criteria are taken into account: 1) the same number of genes and the same number of variations of the index pathway must be simulated in order to limit the bias of selecting significantly longer or shorter genes; 2) a sufficient number of permutated pathways is created (10E5 to 10E6 depending on computational resources) which demands higher computational power; 3) the correct statistical thresholds are identified and discussed; 4) some pathways might be over-represented and the source of information must be constantly updated. The tools for running a molecular pathway analysis (R Foundation for Statistical Computing, 2013) when interacting with a supercluster PC and the international bioinformatic datasets (Embase, NIMH and others), together with the critical steps of bioinformatics scripting (bash language) are described and discussed.

Disclosure of interest

The author has not supplied his declaration of competing interest.

Type
Workshop: big data in psychiatry. unprecedented opportunities, new strategies
Copyright
Copyright © European Psychiatric Association 2017
Submit a response

Comments

No Comments have been published for this article.