» Articles » PMID: 32186998

A Survey on Deep Learning for Multimodal Data Fusion

Overview
Journal Neural Comput
Publisher MIT Press
Date 2020 Mar 19
PMID 32186998
Citations 46
Authors
Affiliations
Soon will be listed here.
Abstract

With the wide deployments of heterogeneous networks, huge amounts of data with characteristics of high volume, high variety, high velocity, and high veracity are generated. These data, referred to multimodal big data, contain abundant intermodality and cross-modality information and pose vast challenges on traditional data fusion methods. In this review, we present some pioneering deep learning models to fuse these multimodal big data. With the increasing exploration of the multimodal big data, there are still some challenges to be addressed. Thus, this review presents a survey on deep learning for multimodal data fusion to provide readers, regardless of their original community, with the fundamentals of multimodal deep learning fusion method and to motivate new multimodal data fusion techniques of deep learning. Specifically, representative architectures that are widely used are summarized as fundamental to the understanding of multimodal deep learning. Then the current pioneering multimodal data fusion deep learning models are summarized. Finally, some challenges and future topics of multimodal data fusion deep learning models are described.

Citing Articles

Emerging trends in SERS-based veterinary drug detection: multifunctional substrates and intelligent data approaches.

Yin T, Peng Y, Chao K, Li Y NPJ Sci Food. 2025; 9(1):31.

PMID: 40089516 DOI: 10.1038/s41538-025-00393-z.


A method of deep network auto-training based on the MTPI auto-transfer learning and a reinforcement learning algorithm for vegetation detection in a dry thermal valley environment.

Chen Y, Zhou B, Xiaopeng C, Ma C, Cui L, Lei F Front Plant Sci. 2025; 15:1448669.

PMID: 40017619 PMC: 11864880. DOI: 10.3389/fpls.2024.1448669.


Multimodality Fusion Aspects of Medical Diagnosis: A Comprehensive Review.

Kumar S, Rani S, Sharma S, Min H Bioengineering (Basel). 2025; 11(12.

PMID: 39768051 PMC: 11672922. DOI: 10.3390/bioengineering11121233.


Low-Rank Tensor Fusion for Enhanced Deep Learning-Based Multimodal Brain Age Estimation.

Liu X, Zheng G, Beheshti I, Ji S, Gou Z, Cui W Brain Sci. 2025; 14(12.

PMID: 39766451 PMC: 11674316. DOI: 10.3390/brainsci14121252.


Damage identification based on the inner product matrix and parallel convolution neural network for frame structure.

He Y, Feng J, Sun B, Wang F, Zhang L, Jiang J Sci Rep. 2024; 14(1):30548.

PMID: 39695320 PMC: 11655645. DOI: 10.1038/s41598-024-82058-7.