Skip to main content

NuchaRt: Embedding High-Level Parallel Computing in R for Augmented Hi-C Data Analysis

  • Conference paper
  • First Online:
Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB 2015)

Abstract

Recent advances in molecular biology and Bioinformatics techniques brought to an explosion of the information about the spatial organisation of the DNA in the nucleus. High-throughput chromosome conformation capture techniques provide a genome-wide capture of chromatin contacts at unprecedented scales, which permit to identify physical interactions between genetic elements located throughout the human genome. These important studies are hampered by the lack of biologists-friendly software. In this work we present NuchaRt, an R package that wraps NuChart-II, an efficient and highly optimized C++ tool for the exploration of Hi-C data. By rising the level of abstraction, NuchaRt proposes a high-performance pipeline that allows users to orchestrate analysis and visualisation of multi-omics data, making optimal use of the computing capabilities offered by modern multi-core architectures, combined with the versatile and well known R environment for statistical analysis and data visualisation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    We actually use data.tables as basic data structures for our datasets: data.table is an enhanced version of data.frame that allows to easily optimise operations for speed and memory usage.

  2. 2.

    https://gephi.org/.

References

  1. Dekker, J., Rippe, K., Dekker, M., Kleckner, N.: Capturing chromosome conformation. Science 295(5558), 1306ā€“1311 (2002)

    ArticleĀ  Google ScholarĀ 

  2. Dixon, J., Selvaraj, S., Yue, F., Kim, A., Li, Y., Shen, Y., Hu, M., Liu, J., Ren, B.: Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature 485(5), 376ā€“80 (2012)

    ArticleĀ  Google ScholarĀ 

  3. Drocco, M., Misale, C., Pezzi, G.P., Tordini, F., Aldinucci, M.: Memory-optimised parallel processing of Hi-C data. In: Proceedings of International Euromicro PDP 2015: Parallel Distributed and Network-Based Processing, pp. 1ā€“8. IEEE, March 2015. http://calvados.di.unipi.it/storage/paper_files/2015_pdp_memopt.pdf

  4. Eddelbuettel, D.: Seamless R and C++ Integration with Rcpp. Springer, New York (2013). ISBN 978-1-4614-6867-7

    BookĀ  MATHĀ  Google ScholarĀ 

  5. Merelli, I., LiĆ², P., Milanesi, L.: Nuchart: An R package to study gene spatial neighbourhoods with multi-omics annotations. PLoS ONE 8(9), e75146 (2013)

    ArticleĀ  Google ScholarĀ 

  6. Tordini, F., Drocco, M., Misale, C., Milanesi, L., LiĆ², P., Merelli, I., Aldinucci, M.: Parallel exploration of the nuclear chromosome conformation with NuChart-II. In: Proceedings of International Euromicro PDP 2015: Parallel Distributed and Network-Based Processing. IEEE, March 2015. http://calvados.di.unipi.it/storage/paper_files/2015_pdp_nuchartff.pdf

  7. Wickham, H.: Advanced R, 1st edn. Chapman and Hall/CRC, Boca Raton (2014)

    Google ScholarĀ 

  8. Winterbach, W., Mieghem, P.V., Reinders, M.J.T., Wang, H., Ridder, D.: Topology of molecular interaction networks. BMC Syst. Biol. 7, 90 (2013)

    ArticleĀ  Google ScholarĀ 

Download references

Acknowledgements

This work has been partially supported by the EC-FP7 STREP project ā€œREPARAā€ (no. 609666), the Italian Ministry of Education and Research Flagship (PB05) ā€œInterOmicsā€, and the EC-FP7 innovation project ā€œMIMOMICSā€.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fabio Tordini .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

Ā© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Tordini, F., Merelli, I., LiĆ², P., Milanesi, L., Aldinucci, M. (2016). NuchaRt: Embedding High-Level Parallel Computing in R for Augmented Hi-C Data Analysis. In: Angelini, C., Rancoita, P., Rovetta, S. (eds) Computational Intelligence Methods for Bioinformatics and Biostatistics. CIBB 2015. Lecture Notes in Computer Science(), vol 9874. Springer, Cham. https://doi.org/10.1007/978-3-319-44332-4_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-44332-4_20

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-44331-7

  • Online ISBN: 978-3-319-44332-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics