The Deep Poincaré Map: A Novel Approach for Left Ventricle Segmentation

Mo, Yuanhan; Liu, Fangde; McIlwraith, Douglas; Yang, Guang; Zhang, Jingqing; He, Taigang; Guo, Yike

doi:10.1007/978-3-030-00937-3_64

Yuanhan Mo¹⁸,
Fangde Liu¹⁸,
Douglas McIlwraith¹⁸,
Guang Yang¹⁹,
Jingqing Zhang¹⁸,
Taigang He²⁰ &
…
Yike Guo¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11073))

Included in the following conference series:

International Conference on Medical Image Computing and Computer-Assisted Intervention

9695 Accesses
7 Citations

Abstract

Precise segmentation of the left ventricle (LV) within cardiac MRI images is a prerequisite for the quantitative measurement of heart function. However, this task is challenging due to the limited availability of labeled data and motion artifacts from cardiac imaging. In this work, we present an iterative segmentation algorithm for LV delineation. By coupling deep learning with a novel dynamic-based labeling scheme, we present a new methodology where a policy model is learned to guide an agent to travel over the image, tracing out a boundary of the ROI – using the magnitude difference of the Poincaré map as a stopping criterion. Our method is evaluated on two datasets, namely the Sunnybrook Cardiac Dataset (SCD) and data from the STACOM 2011 LV segmentation challenge. Our method outperforms the previous research over many metrics. In order to demonstrate the transferability of our method we present encouraging results over the STACOM 2011 data, when using a model trained on the SCD dataset.

You have full access to this open access chapter, Download conference paper PDF

Cardiac MRI Left Ventricular Segmentation and Function Quantification Using Pre-trained Neural Networks

Robust Cardiac MRI Segmentation with Data-Centric Models to Improve Performance via Intensive Pre-training and Augmentation

Cardiac MRI Left Ventricle Segmentation and Quantification: A Framework Combining U-Net and Continuous Max-Flow

1 Introduction

Automatic left ventricle (LV) segmentation from cardiac MRI images is a prerequisite to quantitatively measure cardiac output and perform functional analysis of the heart. However, this task is still challenging due to the requirement for relatively large manually delineated datasets when using statistical shape models or (multi-)atlas based methods. Moreover, as the heart and chest are constantly in motion the resulting images may contain motion artifacts with low signal to noise ratio. Such poor quality images can further complicate the subsequent LV segmentation.

Deep learning based methods have been proved effective for LV segmentation [1,2,3]. A detailed survey of the state-of-the-art lies outside the scope of this paper, but can be found elsewhere [4]. Such approaches are often based on, or extend image recognition research, and thus require large training datasets that are not always available for the cardiac MRI. To the best of our knowledge, there is very limited work using significant prior information to reduce the amount of training data required while maintaining a robust performance for LV segmentation.

In this paper, we propose a novel LV segmentation method called the Deep Poincaré Map (DPM). Our DPM method encapsulates prior information with a dynamical system employed for labeling. Deep learning is then used to learn a displacement policy for traversal around the region of interest (ROI). Given an image, a CNN-based policy model can navigate an agent over the cardiac MRI image, moving toward a path which outlines the LV. At each time step, a next step policy (a 2D displacement) is given by our trained policy model, taking into account the surrounding pixels in a local squared patch. In order to learn the displacement policy, the DPM requires a data transformation step which converts the labeled images into a customized dynamic capturing the prior information around the ROI. An important property of DPM is that no matter where the agent starts, it will finally travel around the ROI. This behavior is guaranteed by the existence of a limit cycle using our customized dynamic.

The main contributions of this work are as follows. (1) The DPM integrates prior information in the form of the context of the image surrounding the ROI. It does this by combining a dynamical system with a deep learning method for building a displacement policy model, and thus requires much less data that traditional deep learning methods. (2) The DPM is rotationally invariant. Because our next step policy predictor is trained with locally oriented patches, the orientation of the image with respect to the ROI is irrelevant. (3) The DPM is strongly transferable. Because the context of the segmentation boundary is considered, our method generalizes well to previously unseen images with the same or similar contexts.

2 Methodology

As shown in Fig. 1, the DPM uses a CNN-based policy model, trained on locally oriented patches from manually segmented data, to navigate an agent over a cardiac MRI image (256 $\times $ 256) using a locally oriented square patch (64 $\times $ 64) as its input. The agent creates a trajectory over the image tracing the boundary of the LV – no matter where the agent starts on the image. A crucial prerequisite of this methodology is the creation of a vector field whose limit cycle is equal to the boundary surrounding the ROI. This can be seen in Fig. 5b. In the following sections we will discuss the DPM methodology in detail, namely (1) the creation of a customized dynamic (i.e. a vector field) with a limit cycle around the ROI of the manually delineated images. (2) The creation of a patch-policy predictor. (3) The stopping criterion using the Poincaré map.

2.1 Generating a Customized Dynamic

A typical training dataset for segmentation consists of many image-to-label pairs. A label is a binary map that has the same resolution as its corresponding image. In each label, pixels of ground truth will be set to 1 while the background will be set to 0. Conversely, in our system, we firstly construct a customized dynamic (a vector field) for each labeled training instance. The constructed dynamic results in a unique limit cycle which is placed exactly on the boundary of the ROI.

To illustrate, let us consider an example indicated in Fig. 2. Consider a label of a training instance as a continuous 2D space $\mathbb {R}^2$ (a label with theoretical infinite resolution), we define the ground truth contour as a subspace $\varOmega \subseteq \mathbb {R}^2$ as shown in step (a) in Fig. 2. To construct a dynamic in $\mathbb {R}^2$ where a limit cycle exists and is exactly the boundary $\partial {\varOmega }$, we firstly introduce the distance function S(p):

$$\begin{aligned} S(p) =\left\{ \begin{array}{ll} d(p, \partial {\varOmega }) \quad &{} \text {if } p \text { is not on } \partial {\varOmega } \\ 0 \quad &{} \text {if } p \text { is on } \partial {\varOmega } \end{array} \right. \end{aligned}$$

(1)

$d(p, \partial {\varOmega })$ denotes the infimum Euclidean distance from p to the boundary $\partial {\varOmega }$. Equation 1 is used to create a scalar field from a binary image as shown in step (b) in Fig. 2. In order to build the customized dynamic, we need to create a vector field from this scalar field. A gradient operator is applied to create dynamic equivalent to the active contour [5] as shown in step (c) in Fig. 2. This gradient operator is expressed as Eq. 2.

$$\begin{aligned} \frac{dp}{dt} = \nabla _p{S(p)}, \end{aligned}$$

(2)

Our final step adds a limit cycle onto the system by gradually rotating the vectors according to the distance between each pixel and the boundary, as shown in Fig. 3b. The rotation function is given by $R(\theta )$,

$$\begin{aligned} R(\theta ) = \begin{bmatrix} \cos {\theta }&-\sin {\theta } \\ \sin {\theta }&\cos {\theta } \end{bmatrix} \end{aligned}$$

(3)

where $\theta $ is defined by Eq. 4.

$$\begin{aligned} \theta = \pi (1 - \mathbf {sigmoid}(S(p))) \end{aligned}$$

(4)

Putting Eqs. 2 and 4 together, we obtain Eq. 5.

$$\begin{aligned} \frac{dp}{dt} = R(\theta ) \nabla _p{S(p)}, \end{aligned}$$

(5)

Equation 5 has an important property: When $p \in \partial {\varOmega }$, $S(p) = 0$ so that $\theta $ is equal to $\frac{\pi }{2}$ according to Eq. 3. This means on the boundary, the direction of $\frac{dp}{dt}$ is equal to the tangent of $p \in \partial {\varOmega }$ as shown in step (d) in Fig. 2.

As opposed to active contour methods [5] where the dynamic is generated from images, we generate the discretized version of Eq. 5 for each label. Then, a vector field is generated from it for each training instance with the property that limit cycle of the field is the boundary of ROI. This process generates a set of tuples (image, label, dynamic). That is, for each cardiac image, we have its associated binary label image, and its corresponding vector field. In the next subsection, we introduce the methodology to learn a CNN which maps an image patch to a vector from our vector field (Fig. 3a). This allows us to create an agent which follows step-by-step displacement predictions.

2.2 Creating a Patch-Policy Predictor Using a CNN

Training. Our CNN operates over patches which are oriented with respect to our created dynamic. In order to prepare data for training, for each training image, we randomly choose a pre-defined proportion of points acting as the center of a rectangular sampling patch. We define a sampling direction which is equal to the velocity vector of the associated point. For example, for a given position $(x_0,y_0)$ on image, its velocity $(\delta x, \delta y)$ in the corresponding vector field is defined as the sampling direction, as shown in Fig. 4. In the training process, such vectors are easily accessible, however they must be predicted during inference (see next Subsect. 2.2). It is worth noting that a coordinate transformation is required to convert the velocity from the coordinate system of the dynamic to that of the patch, as illustrated in Fig. 4. In order to improve robustness, training data augmentation can be performed by adding symmetric offsets to the sampling directions (e.g. (+45$^\circ $, −45$^\circ $)). Our CNN is based on the AlexNet architecture [6] with two output neurons. During training we use Adam optimizer with the mean square error (MSE) loss.

Inference. At the inference stage, before the first time step $t=0$, we determine an initial, rough, starting point using a basic LV detection module and a random sampling direction. This ensures that we don’t start on an image boundary where there is insufficient input to create the first 64$\,\times \,$64 pixel patch, and that we have an initial sampling direction. At each step, given an position $p_t$ and a sampling direction $s_t$ of the agent (which is unknown and is thus inferred as the difference between the current sampling direction and the last), a local patch is extracted and used as the input to the CNN-based policy model. The policy model then predicts the displacement for the agent to move, which in turn leads to the next local patch sample. This process iterates until the limit cycle is reached as illustrated earlier (Fig. 1).

2.3 Stopping Criterion: The Poincaré Map

Instead of identifying the periodic orbit (the limit cycle) from the trajectory itself, we introduce the Poincaré section [7] which is a hyperplane, $\varSigma $, transversal to the trajectory. This cuts through the trajectory of the vector field, as seen in Fig. 5a. The stability of a periodic orbit in the image can be reflected by the procession of corresponding points of intersection in $\varSigma $ (a lower dimensional space). The Poincaré map is the function which maps successive intersection points with the previous point, and thus, when the mapping reaches a small enough value we may say that the procession of the agent in the image has converged to the boundary (the limit cycle). The convergence of customized dynamic has been studied using the Poincaré-Bendixson theorem [7], however the details are beyond the scope of this paper.

3 Experimental Setting and Results

In this study, we evaluate our method on (1) the Sunnybrook Cardiac Dataset (SCD) [8], which contains 45 cases and (2) the STACOM 2011 LV Segmentation Challenge, which contains 100 cases.

SCD Dataset. The DPM was trained on the given training subset. We applied our trained model to the validation and online subsets (800 images from 30 cases in total) to provide a fair comparison with previous research, and we present our findings in Table 1. We report the dice score, average perpendicular distance (APD) (in millimeters) and ‘good’ contour rate (Good) for both the endocardium (i) and epicardium (o). We obtained a mean Dice score of 0.94 with a mean sensitivity of 0.95 and a mean specificity of 1.00.

Transferability to the STACOM2011 Dataset. To demonstrate the strong transferability of our method we train on the training subset of the SCD dataset and test on the STACOM 2011 dataset. We performed myocardium segmentation by segmenting the endocardium and epicardium separately, using 100 randomly selected MRI images from 100 cases. We report the Dice index, sensitivity, specificity, positive and negative predictive values (PPV and NPV) in Table 2. We obtained a mean Dice index of 0.74 with a mean sensitivity of 0.84 and a mean specificity of 0.99.

Table 1. Comparison of LV endocardium and epicardium segmentation performance between DPM and previous research using the Sunnybrook Cardiac Dataset. Number format: mean value (standard deviation).

Full size table

Table 2. Comparison of myocardium segmentation performance by training on SCD data and testing on the STACOM 2011 LVSC dataset. Number format: mean value (standard deviation).

Full size table

4 Conclusion

In this paper we have presented the Deep Poincaré Map as a novel method for LV segmentation and demonstrate its promising performance. The developed DPM method is robust for medical images, which have limited spatial resolution, low SNR and indistinct object boundaries. By encoding prior knowledge of a ROI as a customized dynamic, fine grained learning is achieved resulting in a displacement policy model for iterative segmentation. This approach requires much less training data than traditional methods. The strong transferability and rotational invariance of the DPM can be also attributed to this patch-based policy learning strategy. These two advantages are crucial for clinical applications.

References

Avendi, M., Kheradvar, A., Jafarkhani, H.: A combined deep-learning and deformable-model approach to fully automatic segmentation of the left ventricle in cardiac MRI. Med. Image Anal. 30, 108–119 (2016)
Article Google Scholar
Tan, L.K., et al.: Convolutional neural network regression for short-axis left ventricle segmentation in cardiac cine MR sequences. Med. Image Anal. 39, 78–86 (2017)
Article Google Scholar
Ngo, T.A., Lu, Z., Carneiro, G.: Combining deep learning and level set for the automated segmentation of the left ventricle of the heart from cardiac cine magnetic resonance. Med. Image Anal. 35, 159–171 (2017)
Article Google Scholar
Xue, W., Brahm, G., Pandey, S., Leung, S., Li, S.: Full left ventricle quantification via deep multitask relationships learning. Med. Image Anal. 43, 54–65 (2018)
Article Google Scholar
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models
Google Scholar
Krizhevsky, A., et al.: ImageNet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1–9 (2012)
Google Scholar
Parker, T.S., Chua, L.O.: Practical Numerical Algorithms for Chaotic Systems. Springer, New York (1989). https://doi.org/10.1007/978-1-4612-3486-9
Book MATH Google Scholar
Radau, P., Lu, Y., Connelly, K., Paul, G., Dick, A., Wright, G.: Evaluation framework for algorithms segmenting short axis cardiac MRI. MIDAS J. Card. MR Left Ventricle Segm. Chall. 49 (2009)
Google Scholar
Queirós, S., et al.: Fast automatic myocardial segmentation in 4D cine CMR datasets. Med. Image Anal. 18(7), 1115–1131 (2014)
Article Google Scholar
Ngo, T.A., Carneiro, G.: Left ventricle segmentation from cardiac MRI combining level set methods with deep belief networks. In: 2013 20th IEEE International Conference on Image Processing (ICIP), pp. 695–699. IEEE (2013)
Google Scholar
Hu, H., Liu, H., Gao, Z., Huang, L.: Hybrid segmentation of left ventricle in cardiac MRI using gaussian-mixture model and region restricted dynamic programming. Magn. Reson. Imaging 31(4), 575–584 (2013)
Article Google Scholar
Jolly, M.P., et al.: Automatic segmentation of the myocardium in cine MR images using deformable registration. In: Camara, O. (ed.) STACOM 2011. LNCS, vol. 7085, pp. 98–108. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28326-0_10
Chapter Google Scholar
Margeta, J., Geremia, E., Criminisi, A., Ayache, N.: Layered spatio-temporal forests for left ventricle segmentation from 4D cardiac MRI data. In: Camara, O. (ed.) STACOM 2011. LNCS, vol. 7085, pp. 109–119. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-28326-0_11
Chapter Google Scholar

Download references

Acknowledgement

Yuanhan Mo is sponsored by Sultan Bin Khalifa International Thalassemia Award. Guang Yang is supported by the British Heart Foundation Project Grant (PG/16/78/32402). Jingqing Zhang is supported by LexisNexis HPCC Systems Academic Program. Thanks to TensorLayer Community.

Author information

Authors and Affiliations

Data Science Institute, Imperial College London, London, UK
Yuanhan Mo, Fangde Liu, Douglas McIlwraith, Jingqing Zhang & Yike Guo
National Heart and Lung Institute, Imperial College London, London, UK
Guang Yang
St George’s Hospital, University of London, London, UK
Taigang He

Authors

Yuanhan Mo
View author publications
You can also search for this author in PubMed Google Scholar
Fangde Liu
View author publications
You can also search for this author in PubMed Google Scholar
Douglas McIlwraith
View author publications
You can also search for this author in PubMed Google Scholar
Guang Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jingqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Taigang He
View author publications
You can also search for this author in PubMed Google Scholar
Yike Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yike Guo .

Editor information

Editors and Affiliations

University of Leeds, Leeds, UK
Alejandro F. Frangi
King’s College London, London, UK
Julia A. Schnabel
University of Pennsylvania, Philadelphia, PA, USA
Christos Davatzikos
Universidad de Valladolid, Valladolid, Spain
Carlos Alberola-López
Queen’s University, Kingston, ON, Canada
Gabor Fichtinger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mo, Y. et al. (2018). The Deep Poincaré Map: A Novel Approach for Left Ventricle Segmentation. In: Frangi, A., Schnabel, J., Davatzikos, C., Alberola-López, C., Fichtinger, G. (eds) Medical Image Computing and Computer Assisted Intervention – MICCAI 2018. MICCAI 2018. Lecture Notes in Computer Science(), vol 11073. Springer, Cham. https://doi.org/10.1007/978-3-030-00937-3_64

Download citation

DOI: https://doi.org/10.1007/978-3-030-00937-3_64
Published: 13 September 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00936-6
Online ISBN: 978-3-030-00937-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us