Oops, you're using an old version of your browser so some of the features on this page may not be displaying properly.

MINIMAL Requirements: Google Chrome 24+Mozilla Firefox 20+Internet Explorer 11Opera 15–18Apple Safari 7SeaMonkey 2.15-2.23

Poster Display session

735P - Integrating a data curation artificial intelligence system to identify cancer registry data elements from unstructured electronic health records

Date

07 Dec 2024

Session

Poster Display session

Presenters

Yi-hsin Yang

Citation

Annals of Oncology (2024) 35 (suppl_4): S1679-S1697. 10.1016/annonc/annonc1699

Authors

Y. Yang1, H. Dai2, J. Tsai3, Y. Chang3, T. Wang3, C. Ke4, S. Yu5, Y. Liu5, C. Huang5, C. Lee6, Y. Lee7, J. Liang7, W. Fu8, S. Liao8, S. Kuo8, C. Huang9, L. Lin9, C. Wu9, G. Chang10, I. Chong11

Author affiliations

  • 1 National Institute Of Cancer Research, National Health Research Institutes - National Institute of Cancer Research, 70456 - Tainan/TW
  • 2 Department Of Electrical Engineering, National Kaohsiung University of Science and Technology, 807618 - Kaohsiung/TW
  • 3 National Institute Of Cancer Research, National Health Research Institutes - National Institute of Cancer Research, 704 - Tainan City/TW
  • 4 National Institute Of Cancer Research, National Kaohsiung University of Science and Technology, 807618 - Kaohsiung/TW
  • 5 Cancer Center, Kaohsiung Medical University Chung-Ho Memorial Hospital, 80756 - Kaohsiung City/TW
  • 6 Project Management Office For Intelligence Healthcare, Kaohsiung Medical University Chung-Ho Memorial Hospital, 80756 - Kaohsiung City/TW
  • 7 Department Of Medical Information, Kaohsiung Medical University Chung-Ho Memorial Hospital, 80756 - Kaohsiung City/TW
  • 8 Cancer Center, Chung Shan Medical University Hospital, 40201 - Taichung City/TW
  • 9 Health Promotion Administration, Ministry of Health and Welfare, 10341 - Taipei/TW
  • 10 Institute Of Medicine, Chung Shan Medical University Hospital, 40201 - Taichung City/TW
  • 11 Division Of Chest Medicine, Kaohsiung Medical University Chung-Ho Memorial Hospital, 80756 - Kaohsiung City/TW

Resources

This content is available to ESMO members and event participants.

Abstract 735P

Background

Cancer registry provides essential information to support precision medicine clinical practice and research. Traditionally, it requires cancer registrars’ manual abstraction from unstructured EHRs. The aim of this study is to demonstrate the performance of a hospital-based AI system in supporting cancer registry data element abstraction.

Methods

A natural language processing system was designed with ensemble voting from 3 sub-systems (Hybrid Neural Symbolic System, Hierarchical Attention Network, and Statistical Principle-Based Approach) to incorporate different abstraction rules in cancer diagnosis, staging and treatment data elements. Patient reports were annotated with cancer registry concepts to facilitate the manual coding process. The recommended coding of 40 data elements is provided to cancer registrars for 16 major cancers (oral, salivary gland, nasopharyngeal, esophageal, stomach, colorectal, liver, laryngeal, lung, breast, cervical, uterus, ovarian, prostate, bladder, blood) through a visualization platform. The performance was evaluated using precision, recall, and Fβ-measure (Fβ).

Results

There is total 229,375 reports (pathological, image and surgical notes) from 5451 patients. The average number of reports per person/cancer is 27.5 to 61.5 among different cancers. To emphasize the importance of precision, we use F(0.5)>0.85 as a performance target. There are 4 cancers (bladder, stomach, lung and prostate) with all 40 data elements reaching the target, 9 cancers with 30 to 39 elements, and 3 cancers (oral, salivary gland and breast) with less than 30 data elements. The developed AI system has been incorporated in eight hospitals for further prospective validations.

Conclusions

An ensemble voting system from 3 incorporated sub-systems provides a resolution to the complexity from cancer registry abstraction rules. Our system develops coding recommendations for cancer registrars with feasible performance, and consequently, may improve the quality of coding practices and reduce the labor and time resources for data abstraction.

Clinical trial identification

Editorial acknowledgement

Legal entity responsible for the study

Y-H. Yang.

Funding

Health Promotion Administration, Ministry of Health and Welfare, Taiwan.

Disclosure

All authors have declared no conflicts of interest.

This site uses cookies. Some of these cookies are essential, while others help us improve your experience by providing insights into how the site is being used.

For more detailed information on the cookies we use, please check our Privacy Policy.

Customise settings
  • Necessary cookies enable core functionality. The website cannot function properly without these cookies, and you can only disable them by changing your browser preferences.