Abstract 5820
Background
DNA sequencing to identify variants is becoming increasingly valuable in clinical settings; including matching patients to approved targeted therapies, immunotherapies, and/or clinical trials. However, accurate calling of genetic variants from sequencing still remains challenging. With little corroboration between the different tools available, patients are at risk of being treated with therapies that are unsuitable for their cancer.
Methods
Here we present a novel machine learning based method for the accurate identification of somatic variants in cancer patient tumour samples, with a neural network architecture from encoded raw sequencing read information of tumour/normal sample pairings into an image, enabling it to classify whether a variant is germline, somatic, or sequencing error. The model was trained and tested on in-silico spike-in data using bam-surgeon, and then validated on a multi-cancer and multi-center dataset and benchmarked against industry standard variant callers.
Results
The approach, called somaticNET, outperforms existing industry standard tools in sensitivity and specificity, achieving an AUROC of ∼1.00 on the bam-surgeon dataset and an AUROC of ∼0.99 on the multi-cancer multicenter dataset. The model also works faster than other variant callers, in minutes compared to hours.
Conclusions
Using the power of machine learning for accurate somatic variant calling can improve patient matching to approved therapies and clinical trials, thus ensuring patients are given the right therapy at the right time to treat their cancer.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Cambridge Cancer Genomics.
Disclosure
G. Dubourg-Felonneau: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. D. Rebergen: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. C. Parsons: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H. Thompson: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. J.W. Cassidy: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment, Officer / Board of Directors: Cambridge Cancer Genomics. N. Patel: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H.W. Clifford: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics.
Resources from the same session
3664 - Longitudinal changes in cell-free DNA (cfDNA) methylation levels identify early non-responders to treatment in advanced solid tumors
Presenter: Andrew Davis
Session: Poster Display session 3
Resources:
Abstract
3212 - Multigene panel testing results for hereditary breast cancer in 1325 individuals: implications for gene selection and considerations for guidelines.
Presenter: Georgios Tsaousis
Session: Poster Display session 3
Resources:
Abstract
2591 - PIK3R5 genetic predictors of hypertension induced by VEGF-pathway inhibitors
Presenter: Julia Quintanilha
Session: Poster Display session 3
Resources:
Abstract
4377 - ERBB2 mRNA as a predictor in HER2-positive (HER2+)/hormone receptor-positive (HR+) metastatic breast cancer (BC) treated with HER2 blockade in combination with endocrine therapy (ET): a retrospective analysis of the ALTERNATIVE and SOLTI-PAMELA trials.
Presenter: Nuria Chic
Session: Poster Display session 3
Resources:
Abstract
3439 - Early on-treatment vs pre-treatment tumor transcriptomes as predictors of response to neoadjuvant therapy for HER2-positive inflammatory breast cancer
Presenter: Sonia Pernas
Session: Poster Display session 3
Resources:
Abstract
2512 - AXL expression predicts poor prognosis and lack of efficacy of anti-angiogenic and anti-epidermal growth factor receptor (EGFR) agents in patients (pts) with RAS wild type (WT) metastatic colorectal cancer (mCRC)
Presenter: Claudia Cardone
Session: Poster Display session 3
Resources:
Abstract
4061 - Prevalence of EGFR mutations and its correlation with Egyptian patients’ human kinetics (PEEK Study)
Presenter: Adel Ibrahim
Session: Poster Display session 3
Resources:
Abstract
2547 - Evaluation of tumor microenvironment identifies immune correlates of response to combination immunotherapy with margetuximab (M) and pembrolizumab (P) in HER2+ gastroesophageal adenocarcinoma (GEA)
Presenter: Sergio Rutella
Session: Poster Display session 3
Resources:
Abstract
4671 - Clinicopathological and molecular criteria assessment for the screening of hypermutated proficient mismatch repair (pMMR) colorectal cancers (CRC) with exonucleasic domain POLE (edPOLE) mutations (mt).
Presenter: Benoit Rousseau
Session: Poster Display session 3
Resources:
Abstract
3862 - Tumor mutation burden and microsatellite instability in colorectal cancer
Presenter: Francesca Fenizia
Session: Poster Display session 3
Resources:
Abstract