SANet: A Slice-Aware Network for Pulmonary Nodule Detection
Jie Mei1, Ming-Ming Cheng1, Gang Xu1, Lan-Ruo Wan2, Huan Zhang2
1TKLNDST, CS, Nankai University 2InferVision
Abstract
Lung cancer is the most common cause of cancer death worldwide. A timely diagnosis of the pulmonary nodules makes it possible to detect lung cancer in the early stage, and thoracic computed tomography (CT) provides a convenient way to diagnose nodules. However, it is hard even for experienced doctors to distinguish them from the massive CT slices. The currently existing nodule datasets are limited in both scale and category, which is insufficient and greatly restricts its applications. In this paper, we collect the largest and most diverse dataset named PN9 for pulmonary nodule detection by far. Specifically, it contains 8,798 CT scans and 40,439 annotated nodules from 9 common classes. We further propose a slice-aware network (SANet) for pulmonary nodule detection. A slice grouped non-local (SGNL) module is developed to capture long-range dependencies among any positions and any channels of one slice group in the feature map. And we introduce a 3D region proposal network to generate pulmonary nodule candidates with high sensitivity, while this detection stage usually comes with many false positives. Subsequently, a false positive reduction module (FPR) is proposed by using the multi-scale feature maps. To verify the performance of SANet and the significance of PN9, we perform extensive experiments compared with several state-of-the-art 2D CNN-based and 3D CNN-based detection methods. Promising evaluation results on PN9 prove the effectiveness of our proposed SANet.
Paper
- SANet: A Slice-Aware Network for Pulmonary Nodule Detection, Jie Mei, Ming-Ming Cheng, Gang Xu, Lan-Ruo Wan, Huan Zhang, IEEE TPAMI, 2021. [Project Page] [PDF] bib [Code][Dataset]
Comparison with Other Pulmonary Nodule Datasets
Dataset | Year | Scans | Nodules | Class | Available |
---|---|---|---|---|---|
ANODE09 | 2010 | 55 | 710 | 4 | Yes |
LIDC-IDRI | 2011 | 1,108 | 2,562 | 3 | Yes |
LUNA16 | 2016 | 888 | 1,186 | 2 | Yes |
DSB 2017 | 2017 | 2,101 | N/A | 2 | No |
PN9 | 2021 | 8,798 | 40,439 | 9 | Yes |
'Scans' indicates the number of CT scans.
‘Nodules’ denotes the number of labeled nodules. ‘Class' means the class number. And '
Available’ denotes whether the dataset is available.In Table 1, we compare the PN9 with several existing pulmonary nodule datasets. Compared to the widely used dataset LUNA16, PN9 contains over 10 times more CT scans and over 30 times more annotated nodules. As for the class diversity, other datasets only have three categories: nodule >= 3 mm, nodule < 3 mm, and non-nodule. Due to these limitations, it is difficult for most of the existing nodule datasets to apply to the practice. However, our PN9 contains many CT scans and 9 classes, which will contribute to the detection and classification tasks of the pulmonary nodules, allowing researchers to design more effective algorithms based on different types of nodules. Besides, there are more pulmonary nodules of small size, like 0-3mm solid nodules and 0-5mm ground-glass nodules. It helps identify small nodules more accurately, then the doctors can diagnose and treat patients earlier. In summary, our dataset not only is larger than the previous datasets, but also has superior diversity and performance.
Dataset Properties
Our PN9 has a hierarchical class structure, and its detailed taxonomy is shown in Fig. 2. According to the property of the pulmonary nodules, all nodules in our dataset are first divided into four upper-level classes (denoted as super-class), including solid nodule (SN), part-solid nodule (PSN), ground-glass nodule (GGN), and calcific nodule (CN). Meanwhile, To satisfy the practical demands of doctors and hospitals, we further subdivide the super-class referring to the medical guidelines. Each nodule is assigned with a subordinate class (denoted as sub-class) belonging to a certain super-class based on the nodule size. For example, sub-class 0-3mm solid nodules (denoted as 0-3SN) are defined as any nodules identified to be super-class solid nodules with the most significant in-plane dimension in the range of 0-3 mm. And 9 different sub-classes are finally obtained. The statistics of nodules in each class are shown in Fig. 2 (a). In Fig. 2 (b-c), we show the mutual dependencies among super-classes and sub-classes, respectively. The larger width of a link between two classes indicates a higher probability for the two classes’ nodules appearing in one patient simultaneously. For example, a patient diagnosed with ground-glass nodules is also likely to have solid nodules.
Method
We propose a slice-aware network (SANet) for pulmonary nodule detection. We first introduce an encoder-decoder architecture network to learn the feature of nodules, since their size is much smaller than the common objects in natural images. According to doctors’ diagnosis way, we propose a slice grouped non-local module (SGNL) and add it to the encoder network. SGNL is able to capture long-range dependencies among any positions and any channels of one slice group in the feature map. And 3D region proposal network is introduced to generate pulmonary nodule candidates with high sensitivity, while this detection stage usually comes with many false positives. Subsequently, we develop a false positive reduction module (FPR) by using the multi-scale feature maps.
Terms of Use
This dataset belongs to the Media Computing Lab at Nankai University and is licensed under a Creative Commons Attribution 4.0 License.
Citation
It would be high appreciated if you can cite our paper when using our dataset and code:
@ARTICLE{21PAMI-SANet, title={SANet: A Slice-Aware Network for Pulmonary Nodule Detection}, author={Jie Mei and Ming-Ming Cheng and Gang Xu and Lan-Ruo Wan and Huan Zhang}, journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, year={2021}, doi={10.1109/TPAMI.2021.3065086} }
您好,您的GitHub代码SANet/net/sanet.py的107行forward函数中,返回值的x是不是写错了,应该是经过resnet50之后得到的第一层输出,也就是x1。
请问怎样可以获得你们的数据集呢?
你好,数据集已开源,请点击本页面的[Dataset]链接查看获取说明。