Abstract 1178P
Background
Non-small cell lung cancer (NSCLC) requires multiple genomics testing modalities for optimizing patient outcomes. The foremost of NSCLC biomarkers is EGFR sequencing. Sequencing comes with many challenges, including long turnaround time, high tissue requirements from small biopsies, and cost. An AI model using only digital whole slide images (WSI) can act as a rapid screening test to prioritize tissue for proper sequencing without expending tissue.
Methods
A vision transformer (ViT) base architecture is trained for classification of acinar, solid, lepidic, papillary, and micropapillary morphologies, using 1 million 2242 pixel patches extracted from 3475 WSIs. The training utilizes cross-entropy loss with the Adam optimizer with learning rate of 1e-4 and cosine weight decay scheduler. The pretrained encoder allows for extraction of 768-dimensional feature vectors from the last hidden layer for downstream tasks. For EGFR prediction, each of the 1558 training WSIs are decomposed to 2242 pixel patches and feature embeddings are extracted for each patch. Using a gated attention-based multiple instance learning model, EGFR WSI labels are predicted. The model was optimized using 260 WSIs to obtain best AUC. The best model was evaluated on a held-out set of 6300 WSIs before integration into a mock clinical workflow, enabling in real-time (IRT) EGFR prediction for 7 slides. The informatic backbone identifies WSI at time of scanning and transfers the slide for inference, complted within 30 minutes of scanning.
Results
On the validation dataset of 260 cases, our model exhibited an area under the curve (AUC) of 0.93 with a specificity of 0.90 and sensitivity of 0.88. The model, assessed on an independent validation set of 6300 cases, maintained a high AUC of 0.89 with negative/positive predictive value (NPV/PPV): NPV = 0.90; PPV = 0.71. On IRT cohort, using same threshold: NPV = 1.0; PPV = 0.66.
Conclusions
Implementing such a model that can be ran IRT with clinical WSIs can provide rapid insight and inform ongoing testing protocols (e.g. prioritize tissue for EGFR confirmation when positive or full genomics when negative). Continuous refinement and integration of IRT data will enhance performance to align with clinical process requirements.
Clinical trial identification
Editorial acknowledgement
During the preparation of this work the author(s) used ChatGPT in order to construct the abstract title. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of the publication.
Legal entity responsible for the study
The Warren Alpert Center for Digital and Computational Pathology, Memorial Sloan Kettering Cancer Center.
Funding
The Warren Alpert Foundation, The Warren Alpert Center for Digital and Computational Pathology, Memorial Sloan Kettering Cancer Center.
Disclosure
C.M. Vanderbilt: Financial Interests, Personal, Stocks or ownership: Paige AI. T. Fuchs: Financial Interests, Personal, Advisory Board, Founder, Equity holder, etc: Paige AI. M. Hameed: Financial Interests, Personal, Other, Fiduciary Role/Position: USCAP. A. Dogan: Financial Interests, Personal, Other, Professional Services and Activities: Incyte. All other authors have declared no conflicts of interest.
Resources from the same session
212P - BRGSF-HIS mice as a predictive tool for safety assessment of biologics
Presenter: Kader Thiam
Session: Poster session 09
213P - Constructing a high-definition patient-digital twin (PDT) in treatment-naïve women with advanced cancer
Presenter: Leonardo Garma
Session: Poster session 09
215P - Detection of MUTYH for the prognosis and chemotherapy responsiveness of patients with non-small cell lung cancer
Presenter: Chi Wai Wong
Session: Poster session 09
216P - β-catenin is a potential prognostic biomarker in uterine sarcoma
Presenter: Ying Cai
Session: Poster session 09
218P - Exploiting a unique glycosaminoglycan for novel pan-cancer therapies and diagnostics
Presenter: Mette Agerbæk
Session: Poster session 09
219P - The landscape and prognostic impact of germline HLA-A subtypes in patients with advanced solid cancers
Presenter: Kyrillus Shohdy
Session: Poster session 09
220P - The role of fucosyltransferase 1 (FUT1) in CRC as a putative prognostic and predictive biomarker
Presenter: Lorenz Pammer
Session: Poster session 09
221P - ANGPTL4's role in cancer: A meta analysis and bioinformatics exploration
Presenter: Osama Younis
Session: Poster session 09
222P - Artificial intelligence (AI) based prognostication from baseline computed tomography (CT) scans in a phase III advanced non-small cell lung cancer (aNSCLC) trial
Presenter: Omar Khan
Session: Poster session 09