Tracking and Segmentation of the Airways in Chest CT Using a Fully Convolutional Network

Meng, Qier; Roth, Holger R.; Kitasaka, Takayuki; Oda, Masahiro; Ueno, Junji; Mori, Kensaku

doi:10.1007/978-3-319-66185-8_23

Tracking and Segmentation of the Airways in Chest CT Using a Fully Convolutional Network

Qier Meng²¹,
Holger R. Roth²²,
Takayuki Kitasaka²³,
Masahiro Oda²²,
Junji Ueno²⁴ &
…
Kensaku Mori^21,22

Conference paper
First Online: 04 September 2017

10k Accesses
24 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10434))

Abstract

Airway segmentation plays an important role in analyzing chest computed tomography (CT) volumes such as lung cancer detection, chronic obstructive pulmonary disease (COPD), and surgical navigation. However, due to the complex tree-like structure of the airways, obtaining segmentation results with high accuracy for a complete 3D airway extraction remains a challenging task. In recent years, deep learning based methods, especially fully convolutional networks (FCN), have improved the state-of-the-art in many segmentation tasks. 3D U-Net is an example that optimized for 3D biomedical imaging. It consists of a contracting encoder part to analyze the input volume and a successive decoder part to generate integrated 3D segmentation results. While 3D U-Net can be trained for any 3D segmentation task, its direct application to airway segmentation is challenging due to differently sized airway branches. In this work, we combine 3D deep learning with image-based tracking in order to automatically extract the airways. Our method is driven by adaptive cuboidal volume of interest (VOI) analysis using a 3D U-Net model. We track the airways along their centerlines and set VOIs according to the diameter and running direction of each airway. After setting a VOI, the 3D U-Net is utilized to extract the airway region inside the VOI. All extracted candidate airway regions are unified to form an integrated airway tree. We trained on 30 cases and tested our method on an additional 20 cases. Compared with other state-of-the-art airway tracking and segmentation methods, our method can increase the detection rate by 5.6 while decreasing the false positives (FP) by 0.7 percentage points.

You have full access to this open access chapter, Download conference paper PDF

1 Introduction

Lung cancer is one of the leading causes of cancer-related death. Early detection and diagnosis are essential for decreasing lung cancer related deaths. With the improvement of high-resolution 3D computed tomography (CT) images, more precise treatment of the disease is feasible. To obtain an accurate diagnosis and treatment, extraction of the airway tree in the 3D CT volume is an important step for analyzing various diseases. However, the segmentation of airways from CT volumes is very difficult due to the limited image intensity contrast and increased noise.

Many studies have focused on automated bronchus recognition from 3D chest CT volumes. Lo et al. [1] summarized airway segmentation methods within the EXACT09 challenge. Although most of the conventional approaches can extract the main bronchus and principle bronchi successfully, they fail when it comes to the extraction of the peripheral bronchi. To achieve more branch extraction in peripheral areas, we introduce the concept of “volume of interest” (VOI) that divides a given CT image into subimages containing a bronchial branch region [2]. Every bronchus branch is segmented inside the subimage, and is unified to form an integrated airway tree.

In recent years, fully convolutional networks (FCN) have outperformed the state-of-the-art in many segmentation tasks [3]. U-Net is one example that consists of a contracting encoder part to analyze the whole image and a successive decoder part to generate an integrated segmentation result [4]. 3D U-Net is the 3D extension of U-Net that replaces all 2D operations with their 3D counterparts [5].

In this paper, we build upon a method to segment the airway by using FCN within the VOI tracking framework [2]. The main contribution of our proposed work includes the implementation of 3D U-Net in the selected VOI tracking along the centerlines of the airway. At the same time, the segmentation results can be used to predict airway extraction in the next branching level. Compared to other state-of-the-art airway tracking and segmentation methods, our proposed method can increase the detection rate and reduce the false positive (FP) rate.

2 Method

Figure 1 shows the workflow of our method. We segment the trachea by region growing and set the initial VOI based on the segmented result. 3D U-Net is then used to segment the airway region in each VOI sequentially. To implement the 3D U-Net to obtain the bronchus regions, three different kinds of VOI of different sizes are set. The size of the input volume of the network is $132\times 132\times 116$ voxels. The size of output volume of the network is $44\times 44\times 28$ voxels. The size of the VOIs used in airway tracking algorithm is dynamically adjusted in the airway tracking algorithm depending on the diameter of the bronchi.

2.1 3D U-Net Based Airway Tracking Algorithm

The employed network architecture is the 3D extension of U-Net proposed by Ronneberger et al. [4], which was initially utilized for biomedical image analysis. While U-Net is an entirely 2D architecture, the network utilized in this paper applied all the operations in their 3D version, such as 3D convolutions, 3D max pooling, and 3D up-convolutions [5]. We use the open source^{Footnote 1} implementation of 3D U-Net developed in Caffe [6]. The 3D U-Net architecture consists of an analysis and a synthesis path with four resolution levels each. Each resolution level in the analysis path contains two $3\times 3\times 3$ convolutional layers, each followed by rectified linear units (ReLu), and then a $2\times 2\times 2$ max pooling with strides of two in each dimension. In the synthesis path, the convolutional layers are replaced by up-convolutions of $2\times 2\times 2$ with strides of two in each dimension. These are followed by two $3\times 3\times 3$ convolutions each with a ReLu. Furthermore, 3D U-Net employs shortcut (or skip) connections from layers of equal resolution in the analysis path in order to provide higher-resolution features to the synthesis path [5]. The last layer contains a $1\times 1\times 1$ convolution that reduces the number of output channels to the number of class labels which is K = 2 in our specialized airway extraction case. The model can be trained to minimize a newly introduced weighted voxel-wise cross-entropy loss:

$$\begin{aligned} \small \mathcal {L} \ = \ \frac{-1}{N}\left\{ \lambda \times \left( \sum _{x \in N_0}\log {\left( \hat{p}_{k}(x)\right) } \right) +(1 - \lambda ) \times \left( \sum _{x\in N_1}\log {\left( \hat{p}_{k}(x)\right) } \right) \right\} , \end{aligned}$$

(1)

where $\hat{p}$ are the softmax output class probabilities.

$\lambda $ is a weight factor, N are the total number of voxels x, and $k \in [0,1]$ indicates the lung tissue label of the airway region and the ground truth of the airway regions. The input to this loss function are real valued predictions $x \in [-\infty ,+\infty ]$ from the last convolutional layer. We apply weight $\lambda $ to the loss function (Eq. 1) in order to balance the lung tissue as background with respect to the airway regions as foreground. We choose $\lambda $ with ${\lambda } = \frac{N_1}{N}$, where N is the number of voxels for the lung region, and $N_1$ corresponds to the number of airway (foreground) voxels. Figure 2 shows examples of airway region extraction by 3D U-Net.

2.2 VOI Placement

Due to the requirement of 3D U-Net, three kinds of VOIs are defined. Figure 3 shows the three kinds of VOI. One is the $V_{TRACK}$ which is set to contain a bronchial branch in the airway tracking algorithm. Since the 3D U-Net requires a constant size of images as input, we prepare two wider VOIs, $V_{SEG}$ and $V_{3DU}$. The $V_{SEG}$ of 3D U-Net with a size of $44\times 44\times 28$ voxels is used to save the segmentation output of 3D U-Net. The $V_{3DU}$ is used as the input of 3D U-Net for training and testing with a size of $132\times 132\times 116$ voxels. The height of the $V_{TRACK}$ is 28 voxels, and the length and width is twice the average diameter of the airway regions. We resample the segmentation result from $V_{SEG}$, save it in $V_{TRACK}$, and reconstruct all of the candidate airway regions sequentially to form a complete airway tree.

2.3 Airway Tracking Algorithm

We introduce the 3D U-Net in VOI-based airway tracking algorithm (3D U-VOI) in this section. The airway tracking algorithm starts to place a VOI at the trachea and the airway trees are segmented to place child VOIs from the root along the centerline. In each VOI, an airway region is extracted by 3D U-VOI. The size of each VOI is determined according to the diameter of airway regions. The VOI direction is determined by the centerline extracted in the last VOI. The explanation of the airway tracking algorithm is outlined as follows:

1. Trachea extraction: A trachea region is roughly segmented by a method similar to [7]. The threshold value used in the region growing method is manually determined to extract the trachea region.
2. VOI placement: A VOI is defined as a cuboid with eight vertices and two points: original point $\mathbf {P} _1$, which determines the position of the VOI, and $\mathbf {P} _2$, which are the centers of the two faces shown in Fig. 3(a).
3. 3D U-Net-based airway region extraction in VOI: In order to extract the bronchial branches after trachea extraction, the 3D U-Net is used to extract the bronchial regions in the VOI.
4. Leakage removal: After obtaining the airway regions in each VOI, the 3D labeling process is used for removing the small components detected by 3D U-Net.
5. Furcation detection: The furcation regions are detected by analyzing the number of the connected components on the VOI image surface. The number of connected components indicates the number of furcations. The following conditions will be checked iteratively in each VOI:
- – $\mathbf{N_c = 0.} $ In this VOI, there is no bronchus region to be detected, and the tracing will be terminated.
- – $\mathbf{N_c = 1.} $ In this VOI, furcation hasn’t been detected, but the bronchial region is continued to be traced.
- – $\mathbf{N_c = 2}$ or $\mathbf 3.$ In this VOI, the bifurcation and trifurcation has been detected.
6. VOI extension: We extend the VOI to the bronchus running direction by one voxel. After that the Step (3) is processed again.
7. Branching point detection: We apply the gradient vector flow (GVF) magnitude and tubular-likeness function based on GVF to extract the centerline [9]. The branching point is detected based on this centerline.
8. Child VOI placement: After the branching point detection, the child VOIs $V_{TRACK}$ are placed based on the branching point detected and the center points of the components on the VOI surface. Except for $V_{TRACK}$, the $V_{3DU}$ and $V_{SEG}$ are placed for the 3D U-Net shown in Fig. 3(b). After the child VOI placement, the extraction procedure is repeated [8].
9. Airway tree reconstruction: Finally, we reconstruct all of the candidate bronchus region to form an integrated airway tree.

3 Experiment and Results

To evaluate our proposed method, we used 50 clinical CT scans acquired with standard dose. The size of each slice scan was $512\times 512$ pixels with a pixel size in the range of 0.625–0.976 mm. The number of slices in each dataset ranged from 239 to 962 slices with varying thicknesses from 0.625 to 2.00 mm. The data set was randomly divided into two groups: the training data set which contains 30 cases and the testing data set containing 20 cases. We performed two comparisons. A comparison between the results of airway tracking algorithm with 3D U-Net and the results of only using 3D U-Net in a sliding-window approach without the tracking informations. Sliding-window U-Net method means randomly generating the sliding window around the candidate regions obtained by dilating the groundtruth data by ten voxels. Two indices are used for evaluation: (a) Dice Similarity Coefficients (DSCs), (b) false positive rate (FPR), the number of extracted voxels which were not bronchial voxels according to ground truth. The results are given in Fig. 4 and Table 1.

Table 1. Comparison of segmentation results for airway tree in 20 chest CT examination volumes between the sliding-window-based U-Net and tracking-based U-Net in a sliding-window fashion.

Full size table

We also compared our proposed method with two methods, one is Kitasaka’s method [2] based on the VOI and region growing, and the other is Meng’s method based on local intensity analysis and machine learning technique [10]. The segmentation results, yielded by applying our proposed method to each of the 20 chest CT examinations, are provided in Table 2.

We measured four indices for evaluation: (a) number of branches extracted, (b) true positive rate (TPR), the ratio of extracted branches and total number of branches and (c) computing the false positive rate (FPR), the false branches which don’t belong to ground truth, (d) Dice Similarity Coefficients (DSCs).

Table 2. Comparison of segmentation results for airway tree in 20 chest CT examination volumes. Here, TPR and FPR denotes the true positive and false positive rate of detected branches, respectively. DSCs is the Dice similarity score. (Note: The ground truth data was generated manually and BD indicates branch detection.)

Full size table

Training: In the training phase, we input both of the 3D CT volumes selected from the lung area and the corresponding ground truth data sets. We dilated the ground truth data by ten voxels as the candidate regions, and randomly selected the subvolume images inside the candidate regions. Data augmentation with random 3D rotations at 360$^\circ $ was done on-the-fly, which results in 35k images simulating tracking iterations. Training on 30 cases took 2 days for 35k iterations on an NVIDIA GeForce GTX TITAN X with 12 GB memory, and one iteration generates one subvolume image.

Testing: We obtained the $V_{3DU}$ in the airway tracking algorithm. After fixing the size and direction of the $V_{3DU}$, we input $V_{3DU}$ into the trained network to obtain the candidate airway areas and save them into $V_{SEG}$. After that, we re-sampled the $V_{TRACK}$ used in the airway tracking algorithm by cubic spline interpolation.

Figure 5 shows the evaluation results by the proposed method, as well as comparison with two other methods [2, 10].

4 Discussion

According to the Fig. 4 and Table 1, it can be seen that the result of 3D U-Net in sliding-window can extract many peripheral bronchi but also generate many false positive (FP) regions as well. It is apparent that the proposed method can extract many peripheral bronchi and avoid the FP regions effectively. The 3D U-Net is effective to detect the peripheral bronchi in the chest CT images, however it produces many FP detections when applied in a straight-forward sliding window approach as proposed in [5]. After combining the 3D U-Net with the airway tracking algorithm, the false positive regions can be decreased significantly. Table 2 and Fig. 5 demonstrate that our proposed method outperforms the other two methods. To further illustrate the application results of our proposed method, we show a set of automatic airway segmentations in Fig. 5, which illustrates that our proposed method can extract more peripheral bronchi than the method by Kitasaka et al. [2] and can extract more integrated branches than Meng et al. [10]. Our proposed method utilizes 3D U-Net to extract the airway region in each VOI, which effectively extracts the bronchus region and avoids leakage.

5 Conclusion

This paper introduced an effective integration of 3D U-Net into airway tracking and segmentation in combination with VOI-based algorithm. The proposed method was possible to improve segmentation accuracy significantly. Due to the memory restriction of the GPU, the input volume size to the network is fixed. In the future, networks with dynamic input sizes can be explored.

Notes

1.
http://lmb.informatik.uni-freiburg.de/resources/opensource/unet.en.html.

References

Lo, P., Sporring, J., Ashraf, H., Pedersen, J.J., de Bruijne, M.: Vessel-guided airway tree segmentation: a voxel classification approach. Med. Image Anal. 14(11), 527–538 (2010)
Article Google Scholar
Kitasaka, T., Mori, K., Hasegawa, J.I., Toriwaki, J.I.: A method for extraction of bronchus regions from 3D branch tracing and image sharpening for airway tree chest X-ray images by analyzing structural features of the bronchus. Forma 17, 321–338 (2002)
MATH Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: CVPR, pp. 3431–3440 (2015)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). doi:10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Çiçek, Ö., Abdulkadir, A., Lienkamp, S.S., Brox, T., Ronneberger, O.: 3D U-Net: learning dense volumetric segmentation from sparse annotation. In: Ourselin, S., Joskowicz, L., Sabuncu, M.R., Unal, G., Wells, W. (eds.) MICCAI 2016. LNCS, vol. 9901, pp. 424–432. Springer, Cham (2016). doi:10.1007/978-3-319-46723-8_49
Chapter Google Scholar
Jia, Y., Shelhamer, E., Donahue, J., Karayev, S., Long, J., Girshick, R., Guadarrama, S., Darrell, T.: Caffe: convolutional architecture for fast feature embedding. In: ACM Multimedia 2014 Open Source Software Competition, pp. 675–687 (2014)
Google Scholar
Mori, K., Hasegawa, J., Toriwaki, J., Anno, H., Katada, K.: Automated extraction and visualization of bronchus from 3D CT images of lung. In: Ayache, N. (ed.) CVRMed 1995. LNCS, vol. 905, pp. 542–548. Springer, Heidelberg (1995). doi:10.1007/978-3-540-49197-2_71
Chapter Google Scholar
Meng, Q., Kitasaka, T., Oda, M., Ueno, J., Mori, K.: Airway segmentation from 3D chest CT volumes based on volume of interest using gradient vector flow. In: International Forum on Medical Imaging in Asia, vol. 11, pp. 192–195 (2017)
Google Scholar
Bauer, C., Bischof, H.: A novel approach for detection of tubular objects and its application to medical image analysis. In: Rigoll, G. (ed.) DAGM 2008. LNCS, vol. 5096, pp. 163–172. Springer, Heidelberg (2008). doi:10.1007/978-3-540-69321-5_17
Chapter Google Scholar
Meng, Q., Kitasaka, T., Nimura, Y., Oda, M., Ueno, J., Mori, K.: Automatic segmentation of airway tree based on local intensity filter and machine learning technique in 3D chest CT volume. Int. J. Comput. Assist. Radiol. Surg. 12(2), 245–261 (2017)
Article Google Scholar

Download references

Acknowledgements

The work was supported by MEXT/JSPS KAKENHI Grant Numbers (26108006, 26560255, 17H00867) and JSPS Bilateral Joint Research Project “Oncological Diagnostic and Interventional Assistance System Based on Multi-modality Medical Image Processing”.

Author information

Authors and Affiliations

Graduate School of Information Science, Nagoya University, Nagoya, Japan
Qier Meng & Kensaku Mori
Graduate School of Informatics, Nagoya University, Nagoya, Japan
Holger R. Roth, Masahiro Oda & Kensaku Mori
School of Information Science, Aichi Institute of Technology, Toyota, Japan
Takayuki Kitasaka
Graduate School of Health Sciences, Tokushima University, Tokushima, Japan
Junji Ueno

Authors

Qier Meng
View author publications
You can also search for this author in PubMed Google Scholar
Holger R. Roth
View author publications
You can also search for this author in PubMed Google Scholar
Takayuki Kitasaka
View author publications
You can also search for this author in PubMed Google Scholar
Masahiro Oda
View author publications
You can also search for this author in PubMed Google Scholar
Junji Ueno
View author publications
You can also search for this author in PubMed Google Scholar
Kensaku Mori
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kensaku Mori .

Editor information

Editors and Affiliations

Université de Sherbrooke, Sherbrooke, QC, Canada
Maxime Descoteaux
DKFZ, Heidelberg, Germany
Lena Maier-Hein
Ulm University of Applied Sciences, Ulm, Germany
Alfred Franz
Université de Rennes 1, Rennes, France
Pierre Jannin
McGill University, Montreal, QC, Canada
D. Louis Collins
Université Laval, Québec, QC, Canada
Simon Duchesne

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Meng, Q., Roth, H.R., Kitasaka, T., Oda, M., Ueno, J., Mori, K. (2017). Tracking and Segmentation of the Airways in Chest CT Using a Fully Convolutional Network. In: Descoteaux, M., Maier-Hein, L., Franz, A., Jannin, P., Collins, D., Duchesne, S. (eds) Medical Image Computing and Computer-Assisted Intervention − MICCAI 2017. MICCAI 2017. Lecture Notes in Computer Science(), vol 10434. Springer, Cham. https://doi.org/10.1007/978-3-319-66185-8_23

Download citation

DOI: https://doi.org/10.1007/978-3-319-66185-8_23
Published: 04 September 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66184-1
Online ISBN: 978-3-319-66185-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)