A systematic review of intermediate fusion in multimodal deep learning for biomedical applicationsShow others and affiliations
2025 (English)In: Image and Vision Computing, ISSN 0262-8856, E-ISSN 1872-8138, Vol. 158, article id 105509Article in journal (Refereed) Published
Abstract [en]
Deep learning has revolutionized biomedical research by providing sophisticated methods to handle complex, high-dimensional data. Multimodal deep learning (MDL) further enhances this capability by integrating diverse data types such as imaging, textual data, and genetic information, leading to more robust and accurate predictive models. In MDL, differently from early and late fusion methods, intermediate fusion stands out for its ability to effectively combine modality-specific features during the learning process. This systematic review comprehensively analyzes and formalizes current intermediate fusion methods in biomedical applications, highlighting their effectiveness in improving predictive performance and capturing complex inter-modal relationships. We investigate the techniques employed, the challenges faced, and potential future directions for advancing intermediate fusion methods. Additionally, we introduce a novel structured notation that standardizes intermediate fusion architectures, enhancing understanding and facilitating implementation across various domains. Our findings provide actionable insights and practical guidelines intended to support researchers, healthcare professionals, and the broader deep learning community in developing more sophisticated and insightful multimodal models. Through this review, we aim to provide a foundational framework for future research and practical applications in the dynamic field of MDL.
Place, publisher, year, edition, pages
Elsevier, 2025. Vol. 158, article id 105509
Keywords [en]
Biomedical data, Data fusion, Data integration, Fusion techniques, Healthcare, Joint fusion
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:umu:diva-237396DOI: 10.1016/j.imavis.2025.105509Scopus ID: 2-s2.0-105001226580OAI: oai:DiVA.org:umu-237396DiVA, id: diva2:1951322
Funder
The Kempe Foundations, JCSMK24-00942025-04-102025-04-102025-04-10Bibliographically approved