TY - JOUR
T1 - Lung tumor segmentation methods
T2 - Impact on the uncertainty of radiomics features for non-small cell lung cancer
AU - Owens, Constance A.
AU - Peterson, Christine B.
AU - Tang, Chad
AU - Koay, Eugene J.
AU - Yu, Wen
AU - Mackin, Dennis S.
AU - Li, Jing
AU - Salehpour, Mohammad R.
AU - Fuentes, David T.
AU - Court, Laurence E.
AU - Yang, Jinzhong
N1 - Publisher Copyright:
© 2018 Public Library of Science. All rights reserved.
PY - 2018/10
Y1 - 2018/10
N2 - Purpose To evaluate the uncertainty of radiomics features from contrast-enhanced breath-hold helical CT scans of non-small cell lung cancer for both manual and semi-Automatic segmentation due to intra-observer, inter-observer, and inter-software reliability. Methods Three radiation oncologists manually delineated lung tumors twice from 10 CT scans using two software tools (3D-Slicer and MIM Maestro). Additionally, three observers without formal clinical training were instructed to use two semi-Automatic segmentation tools, Lesion Sizing Toolkit (LSTK) and GrowCut, to delineate the same tumor volumes. The accuracy of the semi-Automatic contours was assessed by comparison with physician manual contours using Dice similarity coefficients and Hausdorff distances. Eighty-Three radiomics features were calculated for each delineated tumor contour. Informative features were identified based on their dynamic range and correlation to other features. Feature reliability was then evaluated using intra-class correlation coefficients (ICC). Feature range was used to evaluate the uncertainty of the segmentation methods. Results From the initial set of 83 features, 40 radiomics features were found to be informative, and these 40 features were used in the subsequent analyses. For both intra-observer and interobserver reliability, LSTK had higher reliability than GrowCut and the two manual segmentation tools. All observers achieved consistently high ICC values when using LSTK, but the ICC value varied greatly for each observer when using GrowCut and the manual segmentation tools. For inter-software reliability, features were not reproducible across the software tools for either manual or semi-Automatic segmentation methods. Additionally, no feature category was found to be more reproducible than another feature category. Feature ranges of LSTK contours were smaller than those of manual contours for all features. Conclusion Radiomics features extracted from LSTK contours were highly reliable across and among observers. With semi-Automatic segmentation tools, observers without formal clinical training were comparable to physicians in evaluating tumor segmentation.
AB - Purpose To evaluate the uncertainty of radiomics features from contrast-enhanced breath-hold helical CT scans of non-small cell lung cancer for both manual and semi-Automatic segmentation due to intra-observer, inter-observer, and inter-software reliability. Methods Three radiation oncologists manually delineated lung tumors twice from 10 CT scans using two software tools (3D-Slicer and MIM Maestro). Additionally, three observers without formal clinical training were instructed to use two semi-Automatic segmentation tools, Lesion Sizing Toolkit (LSTK) and GrowCut, to delineate the same tumor volumes. The accuracy of the semi-Automatic contours was assessed by comparison with physician manual contours using Dice similarity coefficients and Hausdorff distances. Eighty-Three radiomics features were calculated for each delineated tumor contour. Informative features were identified based on their dynamic range and correlation to other features. Feature reliability was then evaluated using intra-class correlation coefficients (ICC). Feature range was used to evaluate the uncertainty of the segmentation methods. Results From the initial set of 83 features, 40 radiomics features were found to be informative, and these 40 features were used in the subsequent analyses. For both intra-observer and interobserver reliability, LSTK had higher reliability than GrowCut and the two manual segmentation tools. All observers achieved consistently high ICC values when using LSTK, but the ICC value varied greatly for each observer when using GrowCut and the manual segmentation tools. For inter-software reliability, features were not reproducible across the software tools for either manual or semi-Automatic segmentation methods. Additionally, no feature category was found to be more reproducible than another feature category. Feature ranges of LSTK contours were smaller than those of manual contours for all features. Conclusion Radiomics features extracted from LSTK contours were highly reliable across and among observers. With semi-Automatic segmentation tools, observers without formal clinical training were comparable to physicians in evaluating tumor segmentation.
UR - http://www.scopus.com/inward/record.url?scp=85054465875&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054465875&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0205003
DO - 10.1371/journal.pone.0205003
M3 - Article
C2 - 30286184
AN - SCOPUS:85054465875
SN - 1932-6203
VL - 13
JO - PLoS ONE
JF - PLoS ONE
IS - 10
M1 - e0205003
ER -