Abstract 2354P
Background
EMT is a process through which epithelial cells acquire mesenchymal-like traits, such as increased migratory capacity, which facilitate cancer cell metastasis. Here we propose an interpretable, scalable, machine learning (ML)-based method to assess EMT status directly from H&E-stained images. We extracted human interpretable features (HIFs) quantifying cancer cell nuclear morphology and the tumor microenvironment from breast cancer H&E images and confirmed concordance with a known ground-truth EMT expression signature.
Methods
Using a breast cancer-specific EMT signature gene set, we calculated quantile normalized EMT scores for each patient in TCGA-BRCA with RNA-seq data (n = 751). Convolutional neural network models trained using H&E-stained whole slide images were used to extract a set of HIFs across available slides (n = 775). An ordinary least squares regression was run for each HIF against EMT score with tumor purity as a covariate. FDR correction was applied to all p-values.
Results
Lower EMT scores were associated with epithelial attributes, such as high nuclei circularity, while high EMT scores were associated with mesenchymal attributes, such as high nuclei eccentricity and greater variation in nuclei perimeter. Epithelial-type tumors were associated with increased immature stromal content and cancer cell proportion. Mesenchymal-type tumors were associated with increased mature stromal content and stromal cell proportion. Table: 2354P
HIF | Coefficient | Standard error | p-value | R2 | FDR corrected p-value |
Mean cancer cell nuclei circularity | -0.047 | 0.0087 | 8.60E-08 | 0.65 | 2.80E-07 |
Mean cancer cell nuclei eccentricity | 0.042 | 0.0089 | 2.30E-06 | 0.65 | 5.80E-06 |
Standard deviation of cancer cell nuclei perimeter | 0.073 | 0.0089 | 1.20E-15 | 0.67 | 8.30E-15 |
Area proportion immature stroma over all stroma | -0.035 | 0.0083 | 2.50E-05 | 0.65 | 5.20E-05 |
Area proportion mature stroma over all stroma | 0.033 | 0.0082 | 8.40E-05 | 0.65 | 0.00016 |
Count proportion cancer cells over all cells | -0.12 | 0.0075 | 6.50E-51 | 0.75 | 9.80E-49 |
Count proportion stromal cells over all cells | 0.098 | 0.0078 | 1.20E-32 | 0.71 | 3.60E-31 |
Conclusions
ML model-generated HIFs effectively capture cancer cell morphology reflective of EMT status, including nuclei eccentricity and circularity. The relationships observed in these results are concordant with previous studies showing increased EMT in tumors with greater extracellular matrix stiffness and large numbers of cancer associated fibroblasts. Overall these results reflect significant agreement between EMT scores and HIFs, indicating potential for quantification of EMT states directly from H&E images.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
PathAI, Inc.
Funding
PathAI, Inc.
Disclosure
C. Kirkup, S. Vilchez, S. Srinivasan, M. Lin, J. Abel, M. Drage, J. Conway, A. Khosla, A. Taylor-Weiner: Financial Interests, Full or part-time Employment: PathAI, Inc. All other authors have declared no conflicts of interest.
Resources from the same session
2325P - Selinexor (XPO1 inhibitor) in combination with tepotinib (MET inhibitor) potentially inhibits SHOC2 and KRAS G12C in KRAS G12C mutant non-small cell lung cancer
Presenter: Rafael Rosell
Session: Poster session 16
2326P - Circulating tumor DNA (ctDNA) dynamics and clinical outcome in metastatic colorectal cancer (mCRC) patients (pts) undergoing front-line chemotherapy
Presenter: Michele Ghidini
Session: Poster session 16
2327P - The impact of the microenvironment on the intratumoral cellular heterogeneity in colorectal cancer
Presenter: Idan Carmi
Session: Poster session 16
2328P - Alternative molecular mechanisms underpinning breast invasive lobular carcinoma identified by genomics-driven artificial intelligence model
Presenter: Fresia Pareja
Session: Poster session 16
2329P - Evidence for the utility of artificial intelligence and image analysis in Ki-67 quantification in solid tumors
Presenter: Xavier Pichon
Session: Poster session 16
2331P - Interest of next generation sequencing (NGS) for integrated molecular diagnosis of HER2-low breast cancer
Presenter: Jean Louis Merlin
Session: Poster session 16
2332P - Overall survival in breast cancer patients with HER2-low status single- center experience
Presenter: Joanna Huszno
Session: Poster session 16
2333P - Associating BRCA1 hypermethylation with clinicopathological and molecular variables in triple-negative breast cancers
Presenter: Anna Karlsson
Session: Poster session 16
2334P - Pathologic patterns following different neoadjuvant therapies in non-small cell lung cancer (NSCLC)
Presenter: Annikka Weissferdt
Session: Poster session 16