Abstract 2354P
Background
EMT is a process through which epithelial cells acquire mesenchymal-like traits, such as increased migratory capacity, which facilitate cancer cell metastasis. Here we propose an interpretable, scalable, machine learning (ML)-based method to assess EMT status directly from H&E-stained images. We extracted human interpretable features (HIFs) quantifying cancer cell nuclear morphology and the tumor microenvironment from breast cancer H&E images and confirmed concordance with a known ground-truth EMT expression signature.
Methods
Using a breast cancer-specific EMT signature gene set, we calculated quantile normalized EMT scores for each patient in TCGA-BRCA with RNA-seq data (n = 751). Convolutional neural network models trained using H&E-stained whole slide images were used to extract a set of HIFs across available slides (n = 775). An ordinary least squares regression was run for each HIF against EMT score with tumor purity as a covariate. FDR correction was applied to all p-values.
Results
Lower EMT scores were associated with epithelial attributes, such as high nuclei circularity, while high EMT scores were associated with mesenchymal attributes, such as high nuclei eccentricity and greater variation in nuclei perimeter. Epithelial-type tumors were associated with increased immature stromal content and cancer cell proportion. Mesenchymal-type tumors were associated with increased mature stromal content and stromal cell proportion. Table: 2354P
HIF | Coefficient | Standard error | p-value | R2 | FDR corrected p-value |
Mean cancer cell nuclei circularity | -0.047 | 0.0087 | 8.60E-08 | 0.65 | 2.80E-07 |
Mean cancer cell nuclei eccentricity | 0.042 | 0.0089 | 2.30E-06 | 0.65 | 5.80E-06 |
Standard deviation of cancer cell nuclei perimeter | 0.073 | 0.0089 | 1.20E-15 | 0.67 | 8.30E-15 |
Area proportion immature stroma over all stroma | -0.035 | 0.0083 | 2.50E-05 | 0.65 | 5.20E-05 |
Area proportion mature stroma over all stroma | 0.033 | 0.0082 | 8.40E-05 | 0.65 | 0.00016 |
Count proportion cancer cells over all cells | -0.12 | 0.0075 | 6.50E-51 | 0.75 | 9.80E-49 |
Count proportion stromal cells over all cells | 0.098 | 0.0078 | 1.20E-32 | 0.71 | 3.60E-31 |
Conclusions
ML model-generated HIFs effectively capture cancer cell morphology reflective of EMT status, including nuclei eccentricity and circularity. The relationships observed in these results are concordant with previous studies showing increased EMT in tumors with greater extracellular matrix stiffness and large numbers of cancer associated fibroblasts. Overall these results reflect significant agreement between EMT scores and HIFs, indicating potential for quantification of EMT states directly from H&E images.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
PathAI, Inc.
Funding
PathAI, Inc.
Disclosure
C. Kirkup, S. Vilchez, S. Srinivasan, M. Lin, J. Abel, M. Drage, J. Conway, A. Khosla, A. Taylor-Weiner: Financial Interests, Full or part-time Employment: PathAI, Inc. All other authors have declared no conflicts of interest.
Resources from the same session
2335P - Spatial whole exome sequencing reveals the genetic features of highly-aggressive components in lung adenocarcinoma
Presenter: Jianfu Li
Session: Poster session 16
2336P - MiR-155 overexpression regulates epithelial-mesenchymal transition through repressing Quaking in lung cancer cells
Presenter: Jung-Jyh Hung
Session: Poster session 16
2337P - Feasibility of mutation detection in cytological samples from metastasis in patients with metastatic prostate cancer
Presenter: Marina Mencinger
Session: Poster session 16
2338P - The phenotype of tumor-associated macrophages in human high-grade serous ovarian carcinoma
Presenter: Irina Larionova
Session: Poster session 16
2339P - Clinical and genomic correlates of plasma circulating tumor DNA (ctDNA) tumor fraction (TF) in patients with advanced NSCLC
Presenter: Filippo Dall'Olio
Session: Poster session 16
2340P - T-cell immunoglobulin and mucin-domain containing molecule-3 (Tim3) in breast cancer and the impact on penetration of blood brain barrier by cancer cells
Presenter: Xiaoshan Cao
Session: Poster session 16
2341P - Novel therapeutic opportunities in metaplastic breast cancer
Presenter: Fresia Pareja
Session: Poster session 16
2342P - Single tumor cells are a novel prognostic criterion in breast cancer patients
Presenter: Liubov Tashireva
Session: Poster session 16
2343P - Evolution of circulating tumour DNA analysis in France over 5 years
Presenter: Guillaume Herbreteau
Session: Poster session 16
2345P - The possibility of programmed death-ligand 1-related immune modulation in neoadjuvant thermoradiotherapy for rectal cancer
Presenter: seihwan you
Session: Poster session 16