TY - JOUR
T1 - Chronic Lymphocytic Leukemia Progression Diagnosis with Intrinsic Cellular Patterns via Unsupervised Clustering
AU - Chen, Pingjun
AU - El Hussein, Siba
AU - Xing, Fuyong
AU - Aminu, Muhammad
AU - Kannapiran, Aparajith
AU - Hazle, John D.
AU - Medeiros, L. Jeffrey
AU - Wistuba, Ignacio I.
AU - Jaffray, David
AU - Khoury, Joseph D.
AU - Wu, Jia
N1 - Publisher Copyright:
© 2022 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2022/5/1
Y1 - 2022/5/1
N2 - Identifying the progression of chronic lymphocytic leukemia (CLL) to accelerated CLL (aCLL) or transformation to diffuse large B-cell lymphoma (Richter transformation; RT) has significant clinical implications as it prompts a major change in patient management. However, the differentiation between these disease phases may be challenging in routine practice. Unsupervised learning has gained increased attention because of its substantial potential in data intrinsic pattern discovery. Here, we demonstrate that cellular feature engineering, identifying cellular phenotypes via unsupervised clustering, provides the most robust analytic performance in analyzing digitized pathology slides (accuracy = 0.925, AUC = 0.978) when compared to alternative approaches, such as mixed features, supervised features, unsupervised/mixed/supervised feature fusion and selection, as well as patch-based convolutional neural network (CNN) feature extraction. We further validate the reproducibility and robustness of unsupervised feature extraction via stability and repeated splitting analysis, supporting its utility as a diagnostic aid in identifying CLL patients with histologic evidence of disease progression. The outcome of this study serves as proof of principle using an unsupervised machine learning scheme to enhance the diagnostic accuracy of the heterogeneous histology patterns that pathologists might not easily see.
AB - Identifying the progression of chronic lymphocytic leukemia (CLL) to accelerated CLL (aCLL) or transformation to diffuse large B-cell lymphoma (Richter transformation; RT) has significant clinical implications as it prompts a major change in patient management. However, the differentiation between these disease phases may be challenging in routine practice. Unsupervised learning has gained increased attention because of its substantial potential in data intrinsic pattern discovery. Here, we demonstrate that cellular feature engineering, identifying cellular phenotypes via unsupervised clustering, provides the most robust analytic performance in analyzing digitized pathology slides (accuracy = 0.925, AUC = 0.978) when compared to alternative approaches, such as mixed features, supervised features, unsupervised/mixed/supervised feature fusion and selection, as well as patch-based convolutional neural network (CNN) feature extraction. We further validate the reproducibility and robustness of unsupervised feature extraction via stability and repeated splitting analysis, supporting its utility as a diagnostic aid in identifying CLL patients with histologic evidence of disease progression. The outcome of this study serves as proof of principle using an unsupervised machine learning scheme to enhance the diagnostic accuracy of the heterogeneous histology patterns that pathologists might not easily see.
KW - Richter transformation (RT)
KW - accelerated CLL
KW - cellular feature engineering
KW - chronic lymphocytic leukemia (CLL)
KW - disease progression
KW - feature fusion
KW - feature selection
KW - large cell transformation
KW - unsupervised clustering
UR - http://www.scopus.com/inward/record.url?scp=85129896640&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129896640&partnerID=8YFLogxK
U2 - 10.3390/cancers14102398
DO - 10.3390/cancers14102398
M3 - Article
C2 - 35626003
AN - SCOPUS:85129896640
SN - 2072-6694
VL - 14
JO - Cancers
JF - Cancers
IS - 10
M1 - 2398
ER -