Abstract 845P
Background
Next Generation sequencing (NGS) has greatly advanced precision oncology. The growing amount and complexity of NGS data, along with other clinical data such as drug responses and measurable residual disease (MRD), present a big challenge for data integration and interpretation. Machine learning, especially deep neural networks, offer a promising approach for efficiently analyzing intricate relationships within large datasets, with the potential of improving clinical outcomes.
Methods
We performed targeted RNA sequencing in routine diagnostics and sequenced 1849 cases with hematological neoplasms, primarily pediatric B-cell precursor acute lymphoblastic leukemia (BCP-ALL). We used featureCounts in megSAP pipeline and Uniform Manifold Approximation and Projection (UMAP) to analyze the gene expression data, and Bokeh to create the web interface for data integration and visualization. Scikit-learn and PyTorch were used to develop shallow machine learning models and deep learning neural networks (NN), respectively.
Results
Our platform integrates various genetic and clinical data based on UMAP analysis of gene expression patterns. It improves the point-of-care decision making and facilitates the discovery of new patterns such as subpopulations with different genetic and clinical features. The platform is supported by machine learning algorithms for cancer subtyping. Six basic machine learning algorithms were coupled with feature selection methods and the best F1 score achieves 98%. We also built a biologically informed deep NN that can accurately predict BCP-ALL subtypes (F1=97%) with a good interpretability, which helps to identify crucial genetic aberrations associated with disease subtypes.
Conclusions
Our machine learning based platform can not only provide support for clinical decision-making but also bring novel translational insights for hematological malignancies.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Bundesministerium für Bildung und Forschung (BMBF).
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
968P - High sensitivity routine blood based detection of HCC: An AI model from 220k patients
Presenter: Kin Nam Kwok
Session: Poster session 18
969P - Establishing a novel routine blood component signature for Hepatocellular Carcinoma (HCC) screening with big clinical data
Presenter: Ka Man Cheung
Session: Poster session 18
970P - Real-world multi-center study of systemic treatment after first-line atezolizumab plus bevacizumab for advanced hepatocellular carcinoma in Asia-Pacific countries
Presenter: Choong-kun Lee
Session: Poster session 18
971P - Effect of preoperative frailty on surgical outcomes following hepatic resection for elderly patients with hepatocellular carcinoma: A multicenter retrospective cohort study from China
Presenter: Zhongqi Fan
Session: Poster session 18
972P - Sequential therapies after atezolizumab plus bevacizumab or lenvatinib first-line treatments in advanced hepatocellular carcinoma
Presenter: Mara Persano
Session: Poster session 18
973P - Clinicopathologic and treatment outcome data in 165 fibrolamellar carcinoma patients
Presenter: Sunyoung Lee
Session: Poster session 18
974P - The barthel index predicts surgical textbook outcomes following hepatectomy for elderly patients with hepatocellular carcinoma: A multicenter cohort study from China
Presenter: Guoyue Lv
Session: Poster session 18
975P - The clinical impact of urinary protein creatinine ratio and AFP at six weeks in patients with unresectable hepatocellular carcinoma treated with atezolizumab plus bevacizumab
Presenter: Kaoru Tsuchiya
Session: Poster session 18
976P - Overall survival in advanced hepatocellular carcinoma treated with concomitant systemic therapy and stereotactic radiation therapy or systemic therapy alone
Presenter: Alexander Piening
Session: Poster session 18