Abstract 2788
Background
Deep learning (DL) is one of the best approaches to predict nonlinear behaviors from high dimensional data. Nevertheless predicting the outcome of patients affected by cancers from transcriptomic data has shown limited performance, even with DL (C-index usually <0.65). Transfer learning is a DL two-step method where a model is pre-trained for a basic task on large amount of data, and then fine-tuned on the aimed task. We hypothesized that using TL with RNAseq may improve the performances of cancer patients’ outcome estimation.
Methods
The model was a Multi-Mayer Perceptron (MLP) with 22913 inputs corresponding to genes bulk tumor whole genome RNAseq expression analysis. An important restriction was applied to the number of units at second layer (N = 100), with further linear decrease across subsequent layers. Architecture of the model (number of layers, skip connections), L1 normalization value and learning rate were optimized by grid search on 30 parallel models. Training was performed using Keras package in R. Data were split into 70% training, 15% cross validation, 15% validation for each step, without contamination between the 2 transfer learning steps. The pre-training step consisted in predicting the organs of sample origin using 17.487 public RNAseq data of normal & cancer tissues (GTEX from gtexportal.org & TCGA from cBioportal.org). Fine-tuning on patients survival used 6401 training tumors. The model’s performance on survival prediction was evaluated by C-index and the area under the survival receiver-operating characteristic curve (AUROC).
Results
The pre-training using GTEx and TCGA reached very high performance with validation accuracy of 0.96 to predict organ of origins for the best model (all models had validation accuracy > 0.9). Fine-tuning on survival, the prognostic performance of the best model on the validation cohort was C-index=0.74 and AUROC= 0.81 (80% of models had a C-index > 0.6). The best model had 8 hidden layers and a small penalization value.
Conclusions
Thanks to this original transfer learning method, we achieved a high performance to estimate cancer patients’ prognostic from whole genome expression, a classically challenging task. Learning on public databases is a valuable method of DL for personalized cancer care.
Clinical trial identification
Legal entity responsible for the study
The authors.
Funding
Has not received any funding.
Disclosure
E. Angevin: Advisory / Consultancy: Amgen; Advisory / Consultancy: Astellas; Advisory / Consultancy: AstraZeneca; Advisory / Consultancy: Bayer; Advisory / Consultancy: BeiGene; Advisory / Consultancy: BMS; Advisory / Consultancy: Celgene; Advisory / Consultancy: DebioPharma; Advisory / Consultancy: Genentech; Advisory / Consultancy: Ipsen; Advisory / Consultancy: Janssen; Advisory / Consultancy: Lilly; Advisory / Consultancy: MedImmune; Advisory / Consultancy: Novartis; Advisory / Consultancy: Pfizer; Advisory / Consultancy: Roche; Advisory / Consultancy: Sanofi; Advisory / Consultancy: Orion. A. Hollebecque: Advisory / Consultancy: Amgen; Advisory / Consultancy: Spectrum Pharmaceuticals; Advisory / Consultancy: Lilly; Advisory / Consultancy: Debiopharm; Travel / Accommodation / Expenses: Servier; Travel / Accommodation / Expenses: Amgen; Travel / Accommodation / Expenses: Lilly; Travel / Accommodation / Expenses: Incyte; Travel / Accommodation / Expenses: Debiopharm. E. Deutsch: Advisory / Consultancy: Boehringer; Advisory / Consultancy: Medimune; Advisory / Consultancy: Amgen; Research grant / Funding (self): AstraZeneca; Research grant / Funding (self): biotrachea; Research grant / Funding (institution): BristolMyersSquidd; Research grant / Funding (self): Clevelex; Research grant / Funding (self): EDF; Research grant / Funding (self): Lilly; Research grant / Funding (self): GlaxoSmisthKline; Research grant / Funding (self): Merk; Research grant / Funding (self): Nanobiotix; Research grant / Funding (self): Oseo; Research grant / Funding (self): Ray Search Laboratory; Research grant / Funding (self): Roche; Research grant / Funding (self): Ipsen; Research grant / Funding (self): Servier; Research grant / Funding (self): Takeda. C. Massard: Advisory / Consultancy: Amgen; Advisory / Consultancy: Astellas; Advisory / Consultancy: AstraZeneca; Advisory / Consultancy: Bayer; Advisory / Consultancy: BeiGene; Advisory / Consultancy: BMS; Advisory / Consultancy: Celgene; Advisory / Consultancy: DebioPharma; Advisory / Consultancy: Genentech; Advisory / Consultancy: Ipsen; Advisory / Consultancy: Janssen; Advisory / Consultancy: Lilly; Advisory / Consultancy: MedImmune; Advisory / Consultancy: Novartis; Advisory / Consultancy: Pfizer; Advisory / Consultancy: Roche; Advisory / Consultancy: Sanofi; Advisory / Consultancy: Orion. L. Verlingue: Research grant / Funding (self): Bristol-Myers Squibb; Advisory / Consultancy: Pierre Fabre; Advisory / Consultancy: Adaptherapy. All other authors have declared no conflicts of interest.
Resources from the same session
5218 - Elevated driver mutational burden or number of perturbed pathways and poor response to abiraterone or enzalutamide in metastatic castration-resistant prostate cancer
Presenter: Bram De Laere
Session: Poster Display session 3
Resources:
Abstract
2452 - High proportion of multiple KRAS mutations in circulating tumor DNA and tumor tissue of pancreatic ductal adenocarcinoma
Presenter: Min Kyeong Kim
Session: Poster Display session 3
Resources:
Abstract
3328 - Biological difference of tumor mutational burden (TMB) and microsatellite instability (MSI) status in patients (pts) with somatic vs. germline BRCA1/2-mutated advanced gastrointestinal (GI) cancers using cell-free DNA (cfDNA) sequencing analysis in the GOZILA study
Presenter: Yasuyuki Kawamoto
Session: Poster Display session 3
Resources:
Abstract
3022 - Cell-Free DNA to Detect Focal Versus Non-Focal MET Amplification in Metastatic Colorectal Cancer Patients: Combined Analysis from Japan and the United States
Presenter: Mishima Saori
Session: Poster Display session 3
Resources:
Abstract
2833 - Presence of circulating tumor DNA in surgically resected renal cell carcinoma is associated with advanced disease and poor patient prognosis
Presenter: Andres Correa
Session: Poster Display session 3
Resources:
Abstract
1376 - Combined genomic and epigenomic assessment of cell-free circulating tumor DNA (cfDNA) for cancer diagnosis and recurrence-risk assessment in early-stage lung cancer
Presenter: Junghee Lee
Session: Poster Display session 3
Resources:
Abstract
4050 - DEMo: a prospective evaluation of a prognostic clinico-molecular composite score in NSCLC patients treated with immunotherapy.
Presenter: Arsela Prelaj
Session: Poster Display session 3
Resources:
Abstract
4727 - Bespoke circulating tumor DNA (ctDNA) analysis as a predictive biomarker in solid tumor patients (pts) treated with single agent pembrolizumab (P)
Presenter: Cindy Yang
Session: Poster Display session 3
Resources:
Abstract
3662 - Dynamic changes in whole-genome cell-free DNA (cfDNA) to identify disease progression prior to imaging in advanced solid tumors
Presenter: Andrew Davis
Session: Poster Display session 3
Resources:
Abstract
3817 - Evaluation of Microsatellite Instability Testing Through cell-free DNA sequencing
Presenter: Shile Zhang
Session: Poster Display session 3
Resources:
Abstract