Abstract 845P
Background
Next Generation sequencing (NGS) has greatly advanced precision oncology. The growing amount and complexity of NGS data, along with other clinical data such as drug responses and measurable residual disease (MRD), present a big challenge for data integration and interpretation. Machine learning, especially deep neural networks, offer a promising approach for efficiently analyzing intricate relationships within large datasets, with the potential of improving clinical outcomes.
Methods
We performed targeted RNA sequencing in routine diagnostics and sequenced 1849 cases with hematological neoplasms, primarily pediatric B-cell precursor acute lymphoblastic leukemia (BCP-ALL). We used featureCounts in megSAP pipeline and Uniform Manifold Approximation and Projection (UMAP) to analyze the gene expression data, and Bokeh to create the web interface for data integration and visualization. Scikit-learn and PyTorch were used to develop shallow machine learning models and deep learning neural networks (NN), respectively.
Results
Our platform integrates various genetic and clinical data based on UMAP analysis of gene expression patterns. It improves the point-of-care decision making and facilitates the discovery of new patterns such as subpopulations with different genetic and clinical features. The platform is supported by machine learning algorithms for cancer subtyping. Six basic machine learning algorithms were coupled with feature selection methods and the best F1 score achieves 98%. We also built a biologically informed deep NN that can accurately predict BCP-ALL subtypes (F1=97%) with a good interpretability, which helps to identify crucial genetic aberrations associated with disease subtypes.
Conclusions
Our machine learning based platform can not only provide support for clinical decision-making but also bring novel translational insights for hematological malignancies.
Clinical trial identification
Editorial acknowledgement
Legal entity responsible for the study
The authors.
Funding
Bundesministerium für Bildung und Forschung (BMBF).
Disclosure
All authors have declared no conflicts of interest.
Resources from the same session
842P - Multicenter real-world study of newly diagnosed advanced-stage extranodal natural killer/T cell lymphoma (ENKTL): Proposal for intensive therapy
Presenter: Yu-Ce Wei
Session: Poster session 18
844P - Progression -free survival prediction of multiple myeloma patients in five European countries using machine learning models
Presenter: Maria Luisa Pleguezuelo Witte
Session: Poster session 18
846P - Artificial intelligence-driven identification of onco-hematology patients who may develop severe COVID-19
Presenter: Souad Assaad
Session: Poster session 18
847P - Fatal infections among leukemia patients
Presenter: Huijie Zhou
Session: Poster session 18
848P - Mortality after rasburicase vs allopurinol anti-hyperuricemia monotherapy in patients with liquid tumors
Presenter: Mitchell Cairo
Session: Poster session 18
849P - Long non-coding RNA signatures and their role in the progression of childhood T cell acute lymphoblastic leukemia
Presenter: Pankaj Sharma
Session: Poster session 18
850P - A zebrafish model of MYC-driven acute myeloid leukemia reveals that neutrophil resistance to oncogenic transformation depends on their ability to promote PP2A-mediated MYC proteasomal degradation
Presenter: Anna Maria Luciano
Session: Poster session 18
851P - Characterization of a zebrafish model of MYC-driven acute myeloid leukemia
Presenter: Anna Maria Luciano
Session: Poster session 18
852P - c-MAF-driven metabolic reprogramming mediates H3K27 hyperacetylation to regulate super enhancer-associated genes
Presenter: Phyllis SY Chong
Session: Poster session 18
946P - Sintilimab plus lenvatinib as conversion therapy in patients with unresectable hepatocellular carcinoma: A prospective, non-randomized, open-label, phase II, expansion cohort study
Presenter: Shichun Lu
Session: Poster session 18