Review of single-cell RNA-seq data clustering for cell-type identification and characterization

  1. Ka-Chun Wong2
  1. 1School of Computer Science and Technology, Xidian University, Xi'an 710071, China
  2. 2Department of Computer Science, City University of Hong Kong, Hong Kong SAR, China
  3. 3School of Artificial Intelligence, Jilin University, Jilin 130012, China
  4. 4College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, China
  1. Corresponding author: sxzhang7-c{at}my.cityu.edu.hk

Abstract

In recent years, the advances in single-cell RNA-seq techniques have enabled us to perform large-scale transcriptomic profiling at single-cell resolution in a high-throughput manner. Unsupervised learning such as data clustering has become the central component to identify and characterize novel cell types and gene expression patterns. In this study, we review the existing single-cell RNA-seq data clustering methods with critical insights into the related advantages and limitations. In addition, we also review the upstream single-cell RNA-seq data processing techniques such as quality control, normalization, and dimension reduction. We conduct performance comparison experiments to evaluate several popular single-cell RNA-seq clustering approaches on simulated and multiple single-cell transcriptomic data sets.

Keywords

This article, published in RNA, is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

| Table of Contents
OPEN ACCESS ARTICLE