Abstract 5820
Background
DNA sequencing to identify variants is becoming increasingly valuable in clinical settings; including matching patients to approved targeted therapies, immunotherapies, and/or clinical trials. However, accurate calling of genetic variants from sequencing still remains challenging. With little corroboration between the different tools available, patients are at risk of being treated with therapies that are unsuitable for their cancer.
Methods
Here we present a novel machine learning based method for the accurate identification of somatic variants in cancer patient tumour samples, with a neural network architecture from encoded raw sequencing read information of tumour/normal sample pairings into an image, enabling it to classify whether a variant is germline, somatic, or sequencing error. The model was trained and tested on in-silico spike-in data using bam-surgeon, and then validated on a multi-cancer and multi-center dataset and benchmarked against industry standard variant callers.
Results
The approach, called somaticNET, outperforms existing industry standard tools in sensitivity and specificity, achieving an AUROC of ∼1.00 on the bam-surgeon dataset and an AUROC of ∼0.99 on the multi-cancer multicenter dataset. The model also works faster than other variant callers, in minutes compared to hours.
Conclusions
Using the power of machine learning for accurate somatic variant calling can improve patient matching to approved therapies and clinical trials, thus ensuring patients are given the right therapy at the right time to treat their cancer.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Cambridge Cancer Genomics.
Disclosure
G. Dubourg-Felonneau: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. D. Rebergen: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. C. Parsons: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H. Thompson: Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. J.W. Cassidy: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment, Officer / Board of Directors: Cambridge Cancer Genomics. N. Patel: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics. H.W. Clifford: Leadership role, Shareholder / Stockholder / Stock options, Full / Part-time employment: Cambridge Cancer Genomics.
Resources from the same session
2671 - Luminal B breast cancer prognosis prediction by comprehensive analysis of Homeobox genes
Presenter: Ayako Nakashoji
Session: Poster Display session 3
Resources:
Abstract
2650 - Long non-coding RNA E2F4as promotes tumor progression and predicts patient prognosis in human ovarian cancer
Presenter: Sun-Ae Park
Session: Poster Display session 3
Resources:
Abstract
1462 - FGF19 promotes esophageal squamous cell carcinoma progression by inhibiting autophagy
Presenter: Lisha Ying
Session: Poster Display session 3
Resources:
Abstract
5787 - Proof of concept on the role of ex vivo lung cancer spheroids, cytokines expression and PBMCs profiling in monitoring disease history and response to treatments.
Presenter: Raimondo Di Liello
Session: Poster Display session 3
Resources:
Abstract
5253 - Circulating microRNAs related to DNA damage response as predictors of survival in metastatic non- small cell lung cancer patients treated with platinum-based chemotherapy
Presenter: Dimitris Mavroudis
Session: Poster Display session 3
Resources:
Abstract
5286 - Prognostic value of CTCs in advanced NSCLC patients treated with platinum-based chemotherapy
Presenter: Silvia Calabuig-Fariñas
Session: Poster Display session 3
Resources:
Abstract
5781 - Exosomes in NSCLC as a source of biomarkers
Presenter: Elena Duréndez
Session: Poster Display session 3
Resources:
Abstract
1447 - The role of Pim-1 in the development and progression of papillary thyroid carcinoma
Presenter: Xin Zhu
Session: Poster Display session 3
Resources:
Abstract
1323 - Development and Validation of a RNA-Seq Based Prognostic Signature in Neuroblastoma
Presenter: Jian-Guo Zhou
Session: Poster Display session 3
Resources:
Abstract
3290 - Identification of meningioma patients in high risk of tumor recurrence using microRNA profiling
Presenter: Josef Srovnal
Session: Poster Display session 3
Resources:
Abstract