From Data to Dialogue: A Communication-Centered Perspective on AI-Based Multimodal Prediction of Neonatal Jaundice
DOI:
https://doi.org/10.52152/RCR.V13.23Keywords:
Artificial Intelligence (AI), Multimodal Prediction, Neonatal Jaundice, Healthcare CommunicationAbstract
Neonatal jaundice is a common issue in the newborn period, and timely, accurate monitoring and prediction are crucial to prevent severe complications (e.g. kernicterus). However, traditional diagnosis relies on invasive serum bilirubin tests or subjective judgment, which limits applicability in developing countries and home settings. In recent years, machine learning techniques have been introduced for non-invasive neonatal jaundice prediction: convolutional neural networks (CNNs) have been used for analyzing skin images of jaundice, and long short-term memory (LSTM) networks for time series prediction of bilirubin changes, achieving some success. Yet, CNNs and LSTMs have limitations such as local receptive fields and difficulty capturing long-term dependencies. This paper comprehensively reviews and reconstructs the machine learning framework for neonatal jaundice prediction by introducing the Transformer model as the core. We use a Vision Transformer (ViT) instead of CNN for processing skin images, and a time-series Transformer in place of LSTM for modeling dynamic data like transcutaneous bilirubin readings, and we fuse multimodal information to improve prediction accuracy. We highlight the potential of this Transformer-based framework in the health communication domain: for example, using attention heatmap visualization to improve model interpretability, and integrating with mobile health (mHealth) applications for remote monitoring and interaction, thereby increasing public awareness and trust in new technology. We detail the methodological architecture, demonstrate its promising application in early prediction of neonatal jaundice, discuss current challenges, and envision future research directions. By leveraging a Transformer-centric multimodal deep learning framework, neonatal jaundice prediction could achieve higher accuracy and interpretability, better serving clinical decision-making and family healthcare.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Junkai Li, Bo Sun, Mohd Rizon Mohamad Juhari, Tiang Sew Sun (Author)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
The philosophy of the journal is to be open and to make all articles accessible. It is our belief that open access is a must in the future of science.
Authors who publish with RCR accept a slightly modified Creative Commons Attribution-NonCommercial 3.0 Unported License.
You, as author, retain the copyrights for your paper, but the Review of Communication Research is granted exclusivity for publication of the article. The agreement allows others to share the work with an acknowledgement of the work's authorship and publication in this journal. We do not want third parties to make a commercial use of the article, unless we agree it with authors.
The journal will run an open review process as well as a traditional peer review process.
When the manuscript is accepted for publication, it will get a doi number and get available online to facilitate early citation.
The journal will post the published article to many public repositories for further diffusion and permanence.
You, as author, are permitted and encouraged to post your work online (e.g., in institutional repositories or on your website), as it can lead to productive exchanges, as well as earlier and greater citation of published work.
If you have any doubts, please, contact the editor: editor@rcommunication.org
Many thanks for submitting your work to this journal.