Abstract 2354P
Background
EMT is a process through which epithelial cells acquire mesenchymal-like traits, such as increased migratory capacity, which facilitate cancer cell metastasis. Here we propose an interpretable, scalable, machine learning (ML)-based method to assess EMT status directly from H&E-stained images. We extracted human interpretable features (HIFs) quantifying cancer cell nuclear morphology and the tumor microenvironment from breast cancer H&E images and confirmed concordance with a known ground-truth EMT expression signature.
Methods
Using a breast cancer-specific EMT signature gene set, we calculated quantile normalized EMT scores for each patient in TCGA-BRCA with RNA-seq data (n = 751). Convolutional neural network models trained using H&E-stained whole slide images were used to extract a set of HIFs across available slides (n = 775). An ordinary least squares regression was run for each HIF against EMT score with tumor purity as a covariate. FDR correction was applied to all p-values.
Results
Lower EMT scores were associated with epithelial attributes, such as high nuclei circularity, while high EMT scores were associated with mesenchymal attributes, such as high nuclei eccentricity and greater variation in nuclei perimeter. Epithelial-type tumors were associated with increased immature stromal content and cancer cell proportion. Mesenchymal-type tumors were associated with increased mature stromal content and stromal cell proportion. Table: 2354P
HIF | Coefficient | Standard error | p-value | R2 | FDR corrected p-value |
Mean cancer cell nuclei circularity | -0.047 | 0.0087 | 8.60E-08 | 0.65 | 2.80E-07 |
Mean cancer cell nuclei eccentricity | 0.042 | 0.0089 | 2.30E-06 | 0.65 | 5.80E-06 |
Standard deviation of cancer cell nuclei perimeter | 0.073 | 0.0089 | 1.20E-15 | 0.67 | 8.30E-15 |
Area proportion immature stroma over all stroma | -0.035 | 0.0083 | 2.50E-05 | 0.65 | 5.20E-05 |
Area proportion mature stroma over all stroma | 0.033 | 0.0082 | 8.40E-05 | 0.65 | 0.00016 |
Count proportion cancer cells over all cells | -0.12 | 0.0075 | 6.50E-51 | 0.75 | 9.80E-49 |
Count proportion stromal cells over all cells | 0.098 | 0.0078 | 1.20E-32 | 0.71 | 3.60E-31 |
Conclusions
ML model-generated HIFs effectively capture cancer cell morphology reflective of EMT status, including nuclei eccentricity and circularity. The relationships observed in these results are concordant with previous studies showing increased EMT in tumors with greater extracellular matrix stiffness and large numbers of cancer associated fibroblasts. Overall these results reflect significant agreement between EMT scores and HIFs, indicating potential for quantification of EMT states directly from H&E images.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
PathAI, Inc.
Funding
PathAI, Inc.
Disclosure
C. Kirkup, S. Vilchez, S. Srinivasan, M. Lin, J. Abel, M. Drage, J. Conway, A. Khosla, A. Taylor-Weiner: Financial Interests, Full or part-time Employment: PathAI, Inc. All other authors have declared no conflicts of interest.
Resources from the same session
2357P - TRX-221, a 4th generation, mutant-selective, and CNS-active EGFR inhibitor with robust antitumor activity in osimertinib-resistant tumor models harboring C797S mutation
Presenter: Sumin Lim
Session: Poster session 16
2358P - Assessing the efficacy of robot-assisted total gastrectomy for advanced proximal gastric cancer: Results from a three-year clinical trial
Presenter: Zhi-Yu Liu
Session: Poster session 16