Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models
Electronic Health Records (EHRs) are multimodal by nature, consisting of structured tabular features like lab tests and unstructured clinical notes. In real-life clinical practice, doctors use complementary multimodal EHR data sources to get a clearer picture of patients' health and support clinical decision-making. However, most EHR predictive models do not reflect these procedures, as they either focus on a single modality or overlook the inter-modality interactions and redundancy.
In this work, we propose MEDFuse, a Multimodal EHR Data Fusion framework that incorporates masked lab-test modeling and large language models (LLMs) to effectively integrate structured and unstructured medical data. MEDFuse leverages multimodal embeddings extracted from two sources: LLMs fine-tuned on free clinical text and masked tabular transformers trained on structured lab test results.
We design a disentangled transformer module, optimized by a mutual information loss to decouple modality-specific and modality-shared information and extract useful joint representation from the noise and redundancy present in clinical notes. Through comprehensive validation on the public MIMIC-III dataset and the in-house FEMH dataset, MEDFuse demonstrates great potential in advancing clinical predictions, achieving over 90% F1 score in the 10-disease multi-label classification task.
A comprehensive framework that effectively combines structured lab tests and unstructured clinical notes
Fine-tuned LLMs process unstructured clinical notes while masked lab-test modeling handles structured tabular data, creating rich semantic representations for each modality.
Extends Masked Autoencoders framework to reconstruct masked lab test components, using asymmetric encoder-decoder architecture to extract meaningful representations from structured data.
Optimized with mutual information loss to separate modality-specific and modality-shared information, extracting useful joint representations while reducing noise and redundancy.
MEDFuse significantly outperforms baseline approaches across all evaluation metrics
Comprehensive evaluation on both public and real-world clinical datasets
Publicly available critical care database used for benchmarking medical AI systems. Focused on top 10 most prevalent conditions including hypertension, cardiac arrhythmias, and diabetes.
Real-world EHR data from Far Eastern Memorial Hospital (2017-2021) including clinical notes, lab results, and comprehensive patient records.
Each component contributes significantly to the overall model performance
@article{phan2024medfuse, title={MEDFuse: Multimodal EHR Data Fusion with Masked Lab-Test Modeling and Large Language Models}, author={Phan, Thao Minh Nguyen and Dao, Cong-Tinh and Wu, Chenwei and Wang, Jian-Zhe and Liu, Shun and Ding, Jun-En and Restrepo, David and Liu, Feng and Hung, Fang-Ming and Peng, Wen-Chih}, journal={Proceedings of the 33rd ACM International Conference on Information and Knowledge Management}, year={2024}, publisher={ACM}, doi={10.1145/3627673.3679962} }