Oops, you're using an old version of your browser so some of the features on this page may not be displaying properly.

MINIMAL Requirements: Google Chrome 24+Mozilla Firefox 20+Internet Explorer 11Opera 15–18Apple Safari 7SeaMonkey 2.15-2.23

Poster session 18

845P - A machine learning based clinical platform for cancer subtyping and data integration in hematological malignancies

Date

21 Oct 2023

Session

Poster session 18

Topics

Cancer Diagnostics

Tumour Site

Leukaemias

Presenters

Michelle Tang

Citation

Annals of Oncology (2023) 34 (suppl_2): S543-S553. 10.1016/S0923-7534(23)01263-2

Authors

M. Tang1, Z. Antic2, P. Fardzadeh1, C. Schröder2, S. Pietzsch2, J. Lentes2, W. Hofmann2, L. von Wolffersdorff1, M. Zimmermann3, M. Horstmann4, W. Nejdl1, G. Cario5, M. Stanulla3, A.K. Bergmann2

Author affiliations

  • 1 L3s Institute, Leibniz University of Hannover, 30167 - Hannover/DE
  • 2 Department Of Human Genetics, MHH - Medizinische Hochschule Hannover, 30625 - Hannover/DE
  • 3 Department Of Pediatric Hematology And Oncology, MHH - Medizinische Hochschule Hannover, 30625 - Hannover/DE
  • 4 Department Of Pediatric Hematology And Oncology, University Medical Center Hamburg-Eppendorf, 20246 - Hamburg/DE
  • 5 Department Of Pediatrics, UKSH - Universitätsklinikum Schleswig-Holstein - Campus Kiel, 24105 - Kiel/DE

Resources

Login to get immediate access to this content.

If you do not have an ESMO account, please create one for free.

Abstract 845P

Background

Next Generation sequencing (NGS) has greatly advanced precision oncology. The growing amount and complexity of NGS data, along with other clinical data such as drug responses and measurable residual disease (MRD), present a big challenge for data integration and interpretation. Machine learning, especially deep neural networks, offer a promising approach for efficiently analyzing intricate relationships within large datasets, with the potential of improving clinical outcomes.

Methods

We performed targeted RNA sequencing in routine diagnostics and sequenced 1849 cases with hematological neoplasms, primarily pediatric B-cell precursor acute lymphoblastic leukemia (BCP-ALL). We used featureCounts in megSAP pipeline and Uniform Manifold Approximation and Projection (UMAP) to analyze the gene expression data, and Bokeh to create the web interface for data integration and visualization. Scikit-learn and PyTorch were used to develop shallow machine learning models and deep learning neural networks (NN), respectively.

Results

Our platform integrates various genetic and clinical data based on UMAP analysis of gene expression patterns. It improves the point-of-care decision making and facilitates the discovery of new patterns such as subpopulations with different genetic and clinical features. The platform is supported by machine learning algorithms for cancer subtyping. Six basic machine learning algorithms were coupled with feature selection methods and the best F1 score achieves 98%. We also built a biologically informed deep NN that can accurately predict BCP-ALL subtypes (F1=97%) with a good interpretability, which helps to identify crucial genetic aberrations associated with disease subtypes.

Conclusions

Our machine learning based platform can not only provide support for clinical decision-making but also bring novel translational insights for hematological malignancies.

Clinical trial identification

Editorial acknowledgement

Legal entity responsible for the study

The authors.

Funding

Bundesministerium für Bildung und Forschung (BMBF).

Disclosure

All authors have declared no conflicts of interest.

This site uses cookies. Some of these cookies are essential, while others help us improve your experience by providing insights into how the site is being used.

For more detailed information on the cookies we use, please check our Privacy Policy.

Customise settings
  • Necessary cookies enable core functionality. The website cannot function properly without these cookies, and you can only disable them by changing your browser preferences.