Review
Overview of clinical flow cytometry data analysis: recent advances and future challenges

https://doi.org/10.1016/j.tibtech.2013.04.008Get rights and content

Highlights

  • Polychromatic flow cytometry generates increasingly complex n-dimensional data sets.

  • New tools are being created for objective flow cytometry data analysis/interpretation.

  • Full automation of data analysis remains a challenge.

Major technological advances in flow cytometry (FC), both for instrumentation and reagents, have emerged over the past few decades. These advances facilitate simultaneous evaluation of more parameters in single cells analyzed at higher speed. Consequently, larger and more complex data files that contain information about tens of parameters for millions of cells are generated. This increasing complexity has challenged pre-existing data analysis tools and promoted the development of new algorithms and tools for data analysis and visualization. Here, we review the currently available (conventional and newly developed) data analysis and visualization strategies that aim for easier, more objective, and robust interpretation of FC data both in biomedical research and clinical diagnostic laboratories.

Introduction

Together with the microscope, flow cytometers have become invaluable instruments for cell analysis both in research and clinical laboratories, particularly for the evaluation of normal and malignant hematopoietic and lymphoid cells [1]. Since the 1980s, FC has progressively extended from basic research to clinical diagnostic laboratories. Development of the hybridoma technology and massive production of monoclonal antibodies, together with many well-characterized, high-quality reagents and a progressively broad variety of compatible fluorochromes [1], have been crucial in the translation of FC to clinical diagnostic laboratories. In the past two decades, multiparameter FC immunophenotyping of normal, reactive, and malignant cells has been augmented by: (i) a larger number of parameters that can be simultaneously assessed in individual cells [2], and (ii) greater analysis speed of digital cytometers [3] that interrogate tens of thousands of cells per second. However, the exponentially growing, complex information about individual cells (and their products) in a sample, has required changes and developments in both data analysis, representation and visualization tools [4].

For decades, conventional FC data analysis has identified unique cell populations within a sample and characterized them by defining a number of regions (e.g., ‘gates’) in single and bidimensional plots. Such gating strategies are typically set for the selection of the subpopulations of interest by an experienced operator [5]. However, due to the greater multiparameter capabilities of FCs, strategies to extract maximum information from an n-dimensional space and to provide clear, simple and intuitive graphical representation of data, have become a major challenge [4]. In fact, the real challenge is to build FC data mining tools that extract relevant information in an objective, precise, reproducible, and comprehensive way. Moreover, it should be available to users through intuitive graphical representations and user-friendly interpretation-guided tools.

FC data analysis can be roughly divided into three steps: (i) generating FC data (Box 1); (ii) data storage; and (iii) mining stored information. In this paper we review the most relevant developments achieved in FC data analysis, and provide a perspective on current needs and potential future developments in the field.

Section snippets

FC data storage

Early flow cytometers and cell counters were developed before computers were integrated into analytical instruments 4, 6, 7, 8, 9. At that time, screens capable of representing analog signals in real time were used for fast analysis of results associated with individual cell measurements [10]. Progressive development of computer technology rapidly led to its incorporation in FC. Computers were used to control the instrument and to store data derived from the cellular measurements performed with

Number of dimensions in multiparameter FC data

Hematological and lymphoid tissues such as peripheral blood (PB), bone marrow, lymph node, thymus, and other specimens, are composed of different cell populations defined by multiple cell lineages, maturation stages, functional compartments, and activation states, mixed at variable percentages 1, 17, 18, 19, 20, 21, 22, 23, 24. Individual cell populations can be identified by binding high affinity antibodies to protein structures or by unique cell characteristics 24, 25, 26, 27. Although

Towards automated multiparameter FC data analysis and automated identification of cell populations

A common step in FC data analysis is the establishment of an association between each individual cellular event and specific populations of events with biological meaning. For pattern classification, single events, modeled as elements of an n-dimensional space, are linked to a class corresponding to a population of cells of interest. This is based on the principle that cells that belong to the same population will most probably show highly overlapping (e.g., similar) immunophenotypic features.

Visualization of multidimensional FC data

Adequate visualization of FC data strongly contributes to optimal data analysis and interpretation. Common tools currently used for FC data visualization (Figure 2) include single-parameter histograms, bivariate plots, and 3D representations of primary FC parameters – for example, light scatter or fluorescence emissions – (Figure 2A–C, respectively). Such plots (except for 3D views) are used to select (e.g., gate) groups of events and identify cell populations of interest, in an expert-based

Concluding remarks and future perspectives

FC has significantly advanced with the generation of increasingly complex n-dimensional data sets. New tools and procedures are devoted to data manipulation, data analysis, visualization, and interpretation. Nevertheless, further developments are still needed (Box 5), including: (i) increased reproducibility of FC measurements across different reagent and instrument platforms; (ii) optimized computational processing times; (iii) freely available reference data files and profiles for extended

Disclaimer statement

Cytognos S.L. is part of the UE-supported EuroFlow Research Consortium, and has implemented some of the algorithms described in the present study, in its proprietary software INFINICYT; Cytognos S.L. has a contract license of several patents owned by the University of Salamanca, of which A. Orfao, C.E. Pedreira, and E.S. Costa are inventors. Other authors declare no competing financial interests.

Glossary

  • APS1: Automated Population Separator 1; bidimensional dot plot representing the first versus

Acknowledgments

This work has been partially supported by the following grants: Spanish Network of Cancer Research Centers (ISCIII RTICC- RD12/0036/0048-FEDER), FIS 08/90881-FEDER from the ‘Fondo de Investigación Sanitaria’, Ministerio de Ciencia e Innovación (Madrid, Spain); Bilateral International Cooperation (Spain-Brazil), Ministerio de Ciencia e Innovación (Madrid, Spain) and CNPq-Brazilian National Research Council (Brasília, Brazil), Refs. PIB2010BZ-00565 and 560757/2010-7; the Euroflow Consortium

References (78)

  • J. Martini

    Time encoded multicolor fluorescence detection in a microfluidic flow cytometer

    Lab Chip

    (2012)
  • J.P. Robinson

    Computational analysis of high-throughput flow cytometry data

    Expert Opin. Drug Discov.

    (2012)
  • A. Orfao

    Useful information provided by the flow cytometric immunophenotyping of hematological malignancies: current status and future directions

    Clin. Chem.

    (1999)
  • J.W. Stewart

    The use of electronic blood-cell counters in routine haematology

    Br. J. Haematol.

    (1967)
  • H.R. Hulett

    Cell sorting: automated separation of mammalian cells as a function of intracellular fluorescence

    Science

    (1969)
  • W.A. Bonner

    Fluorescence activated cell sorting

    Rev. Sci. Instrum.

    (1972)
  • H.R. Hulett

    Development and application of a rapid cell sorter

    Clin. Chem.

    (1973)
  • M. Stöhr et al.

    Visualization of multidimensional spectra in flow cytometry

    J. Histochem. Cytochem.

    (1979)
  • R.F. Murphy et al.

    A proposal for a flow cytometric data file standard

    Cytometry

    (1984)
  • S.S. Stewart et al.

    Realtime acquisition, storage, and display of correlated three-parameter flow cytometric data

    Cytometry

    (1986)
  • Data file standard for flow cytometry

    Cytometry

    (1990)
  • M.A. Naivar

    Open, reconfigurable cytometric acquisition system: ORCAS

    Cytometry A

    (2007)
  • N.A. Zilmer

    Flow cytometric analysis using digital signal processing

    Cytometry

    (1995)
  • L.C. Seamer

    Proposed new data file standard for flow cytometry, version FCS 3.0

    Cytometry

    (1997)
  • E.E. Gray

    Subcapsular sinus macrophage fragmentation and CD169+ bleb acquisition by closely associated IL-17-committed innate-like lymphocytes

    PLoS ONE

    (2012)
  • C. Arnoulet

    Four- and five-color flow cytometry analysis of leukocyte differentiation pathways in normal bone marrow: a reference document based on a systematic approach by the GTLLF and GEIL

    Cytometry B: Clin. Cytom.

    (2010)
  • T. Szczepański

    Flow-cytometric immunophenotyping of normal and malignant lymphocytes

    Clin. Chem. Lab. Med.

    (2006)
  • A. Caraux

    Circulating human B and plasma cells. Age-associated changes in counts and detailed characterization of circulating normal CD138 and CD138+ plasma cells

    Haematologica

    (2010)
  • M. Perez-Andres

    Human peripheral blood B-cell compartments: a crossroad in B-cell traffic

    Cytometry B: Clin. Cytom.

    (2010)
  • E.S. Costa

    A new automated flow cytometry data analysis approach for the diagnostic screening of neoplastic B-cell disorders in peripheral blood samples with absolute lymphocytosis

    Leukemia

    (2006)
  • M. Vanderlaan

    Improved high-affinity monoclonal antibody to iododeoxyuridine

    Cytometry

    (1986)
  • Y. Phung

    High-affinity monoclonal antibodies to cell surface tumor antigen glypican-3 generated through a combination of peptide immunization and flow cytometry screening

    MAbs

    (2012)
  • S.P. Perfetto

    Seventeen-colour flow cytometry: unravelling the immune system

    Nat. Rev. Immunol.

    (2004)
  • J. Hoffmann

    High-throughput 13-parameter immunophenotyping identifies shifts in the circulating T-cell compartment following reperfusion in patients with acute myocardial infarction

    PLoS ONE

    (2012)
  • F. Gondois-Rey

    Multiparametric cytometry for exploration of complex cellular dynamics

    Cytometry A

    (2012)
  • J.P. Nolan et al.

    Spectral flow cytometry

    Curr. Protoc. Cytom.

    (2013)
  • S. Böttcher

    Flow cytometric MRD detection in selected mature B-cell malignancies

    Methods Mol. Biol.

    (2013)
  • B. Denys

    Improved flow cytometric detection of minimal residual disease in childhood acute lymphoblastic leukemia

    Leukemia

    (2013)
  • C.E. Pedreira

    Generation of flow cytometric data files with a potentially infinite number of dimensions

    Cytometry A

    (2008)
  • Cited by (107)

    • Nonimaging evaluation of patients with nervous system metastases

      2022, Neurological Complications of Systemic Cancer and Antineoplastic Therapy
    View all citing articles on Scopus
    View full text