Abstract 5820
Background
DNA sequencing to identify variants is becoming increasingly valuable in clinical settings; including matching patients to approved targeted therapies, immunotherapies, and/or clinical trials. However, accurate calling of genetic variants from sequencing still remains challenging. With little corroboration between the different tools available, patients are at risk of being treated with therapies that are unsuitable for their cancer.
Methods
Here we present a novel machine learning based method for the accurate identification of somatic variants in cancer patient tumour samples, with a neural network architecture from encoded raw sequencing read information of tumour/normal sample pairings into an image, enabling it to classify whether a variant is germline, somatic, or sequencing error. The model was trained and tested on in-silico spike-in data using bam-surgeon, and then validated on a multi-cancer and multi-center dataset and benchmarked against industry standard variant callers.
Results
The approach, called somaticNET, outperforms existing industry standard tools in sensitivity and specificity, achieving an AUROC of ∼1.00 on the bam-surgeon dataset and an AUROC of ∼0.99 on the multi-cancer multicenter dataset. The model also works faster than other variant callers, in minutes compared to hours.
Conclusions
Using the power of machine learning for accurate somatic variant calling can improve patient matching to approved therapies and clinical trials, thus ensuring patients are given the right therapy at the right time to treat their cancer.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Cambridge Cancer Genomics.
Disclosure
G. Dubourg-Felonneau: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. D. Rebergen: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. C. Parsons: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H. Thompson: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. J.W. Cassidy: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment, Officer / Board of Directors: Cambridge Cancer Genomics. N. Patel: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H.W. Clifford: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics.
Resources from the same session
5603 - Development of a comprehensive next-generation targeted sequencing assay for detection of gene-fusions in solid tumors
Presenter: Vinay Mittal
Session: Poster Display session 3
Resources:
Abstract
4952 - Next-generation sequencing for better treatment strategy of cancer of unknown primary (CUP)
Presenter: Kang Kook Lee
Session: Poster Display session 3
Resources:
Abstract
4590 - Circulating-free DNA analysis from long-term surviving metastatic colorectal cancer patients undergoing surgery for resectable disease.
Presenter: Michele Ghidini
Session: Poster Display session 3
Resources:
Abstract
3696 - Ultra-sensitive detection of circulating tumor DNA identifies patients in high risk of recurrence in early stages melanoma
Presenter: Filip Janku
Session: Poster Display session 3
Resources:
Abstract
4295 - Identification of the founder BRCA1 mutation c.4117G>T (p.Glu1373*) recurring in Abruzzo and Lazio regions of Central Italy and predisposing to breast/ovarian and BRCA1-related cancers
Presenter: Daniela Di Giacomo
Session: Poster Display session 3
Resources:
Abstract
2214 - Enzalutamide (ENZA) and Apalutamide (APA) In vitro chemical reactivity studies and Activity in a Mouse Drug Allergy Model (MDAM)
Presenter: Mausumee Guha
Session: Poster Display session 3
Resources:
Abstract
5044 - Influence of genetic variation in COMT on cisplatin-induced nephrotoxicity in cancer patients.
Presenter: Bram Agema
Session: Poster Display session 3
Resources:
Abstract
3293 - Cardioprotective and anti-inflammatory effects of Empagliflozin during treatment with Doxorubicin: a cellular and preclinical study
Presenter: Vincenzo Quagliariello
Session: Poster Display session 3
Resources:
Abstract
3324 - Breast Cancer Organoids Model Treatment Response of HER2 Targeted Therapy in HER2-mutant Breast Cancer
Presenter: Xuelu Li
Session: Poster Display session 3
Resources:
Abstract
2115 - Preclinical in vivo screening to predict responder patients depend on EGFR status
Presenter: Yejin Kim
Session: Poster Display session 3
Resources:
Abstract