A Short Survey on Deep Learning for Multimodal Integration: Applications, Future Perspectives and Challenges
A Short Survey on Deep Learning for Multimodal Integration: Applications, Future Perspectives and Challenges
Blog Article
Deep learning has achieved state-of-the-art performances in several research applications nowadays: from computer vision to bioinformatics, from object detection to image generation.In the context of such newly developed deep-learning approaches, we can define the concept of multimodality.The objective of this research kenja 88 2023 field is to implement methodologies which can use several modalities as input features to perform predictions.In this, there is a strong analogy with respect to what happens with human cognition, since we rely on several different senses to make decisions.
In this article, we present a short survey puddles the platypus chocolate on multimodal integration using deep-learning methods.In a first instance, we comprehensively review the concept of multimodality, describing it from a two-dimensional perspective.First, we provide, in fact, a taxonomical description of the multimodality concept.Secondly, we define the second multimodality dimension as the one describing the fusion approaches in multimodal deep learning.
Eventually, we describe four applications of multimodal deep learning to the following fields of research: speech recognition, sentiment analysis, forensic applications and image processing.