TY - GEN
T1 - A dimension reduction strategy for improving the efficiency of computer-aided detection for CT colonography
AU - Song, Bowen
AU - Zhang, Guopeng
AU - Wang, Huafeng
AU - Zhu, Wei
AU - Liang, Zhengrong
PY - 2013
Y1 - 2013
N2 - Various types of features, e.g., geometric features, texture features, projection features etc., have been introduced for polyp detection and differentiation tasks via computer aided detection and diagnosis (CAD) for computed tomography colonography (CTC). Although these features together cover more information of the data, some of them are statistically highly-related to others, which made the feature set redundant and burdened the computation task of CAD. In this paper, we proposed a new dimension reduction method which combines hierarchical clustering and principal component analysis (PCA) for false positives (FPs) reduction task. First, we group all the features based on their similarity using hierarchical clustering, and then PCA is employed within each group. Different numbers of principal components are selected from each group to form the final feature set. Support vector machine is used to perform the classification. The results show that when three principal components were chosen from each group we can achieve an area under the curve of receiver operating characteristics of 0.905, which is as high as the original dataset. Meanwhile, the computation time is reduced by 70% and the feature set size is reduce by 77%. It can be concluded that the proposed method captures the most important information of the feature set and the classification accuracy is not affected after the dimension reduction. The result is promising and further investigation, such as automatically threshold setting, are worthwhile and are under progress.
AB - Various types of features, e.g., geometric features, texture features, projection features etc., have been introduced for polyp detection and differentiation tasks via computer aided detection and diagnosis (CAD) for computed tomography colonography (CTC). Although these features together cover more information of the data, some of them are statistically highly-related to others, which made the feature set redundant and burdened the computation task of CAD. In this paper, we proposed a new dimension reduction method which combines hierarchical clustering and principal component analysis (PCA) for false positives (FPs) reduction task. First, we group all the features based on their similarity using hierarchical clustering, and then PCA is employed within each group. Different numbers of principal components are selected from each group to form the final feature set. Support vector machine is used to perform the classification. The results show that when three principal components were chosen from each group we can achieve an area under the curve of receiver operating characteristics of 0.905, which is as high as the original dataset. Meanwhile, the computation time is reduced by 70% and the feature set size is reduce by 77%. It can be concluded that the proposed method captures the most important information of the feature set and the classification accuracy is not affected after the dimension reduction. The result is promising and further investigation, such as automatically threshold setting, are worthwhile and are under progress.
KW - Computer-aided detection/diagnosis
KW - CTC
KW - FP reduction
KW - Hierarchical clustering
KW - PCA
UR - https://www.scopus.com/pages/publications/84878407477
U2 - 10.1117/12.2006965
DO - 10.1117/12.2006965
M3 - Conference contribution
AN - SCOPUS:84878407477
SN - 9780819494443
T3 - Proceedings of SPIE - The International Society for Optical Engineering
BT - Medical Imaging 2013
T2 - Medical Imaging 2013: Computer-Aided Diagnosis
Y2 - 12 February 2013 through 14 February 2013
ER -