"topic:multimodal-fusion" — Search

62 results for “topic:multimodal-fusion”

Semantic Segmentation for Remote Sensing

mambamultimodal-fusionremote-sensingsegment-anything-modalsemantic-segmentationtransformerunsupervised-domain-adaptationvery-high-resolution

icey-zhang/SuperYOLO

SuperYOLO is accepted by TGRS

Python47268Updated 2 days ago

multimodal-fusionobject-detectionremote-sensing

mahmoodlab/MCAT

Multimodal Co-Attention Transformer for Survival Prediction in Gigapixel Whole Slide Images - ICCV 2021

Jupyter Notebook23844Updated 2 days ago

early-fusiongenomicsmahmoodlabmcatmultimodalmultimodal-deep-learningmultimodal-fusionpathology

V-

v-iashin/BMT

Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)

Jupyter Notebook23057Updated 3 weeks ago

activitynet-captionsaudiobi-modal-encoderbi-modal-transformerbmtbmvcbmvc20dense-video-captioningi3dmultimodal-fusionproposal-generatorpytorchtemporal-action-proposalstransformervideovideo-features

declare-lab/Multimodal-Infomax

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

Python19736Updated 1 day ago

multimodal-deep-learningmultimodal-fusionmultimodal-sentiment-analysis

thuiar/MIntRec

MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)

Python12915Updated 2 days ago

acm-mmacm-mm-22artificial-intelligencemultimodal-deep-learningmultimodal-fusionmultimodal-intent-analysisspeaker-recognition

icey-zhang/E2E-MFD

E2E-MFD-OOD

Jupyter Notebook968Updated 7 hours ago

multimodal-fusionobject-detectionoriented-object-detection

akashe/Multimodal-action-recognition

Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.

Python7312Updated 11 months ago

cross-attentionmultimodal-action-recognitionmultimodal-datamultimodal-deep-learningmultimodal-fusionmultimodal-learningmultimodality

Wenchuan-Zhang/Patho-AgenticRAG

[AAAI-2026] Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning

Python526Updated 3 days ago

agentic-retrieval-augmented-generationmultimodal-fusionpathology-diagnosispathology-multiagentreinforcement-learning

ai-forever/fusion_brain_aij2021

Creating multimodal multitask models

Jupyter Notebook5016Updated 1 year ago

bilingualhandwritten-text-recognitionjava-to-pythonmultimodal-fusionmultitaskvisual-question-answeringzero-shot-object-detection

declare-lab/hfusion

Multimodal sentiment analysis using hierarchical fusion with context modeling

Python4422Updated 1 year ago

data-fusionemotion-analysisemotion-detectionemotion-recognitionfusionmultimodal-fusionmultimodal-interactionsmultimodal-sentiment-analysissentiment-analysis

gholste/breast_mri_fusion

[CVAMD 2021] "End-to-End Learning of Fused Image and Non-Image Feature for Improved Breast Cancer Classification from MRI"

Python439Updated 4 months ago

breast-cancerdeep-learningmultimodal-fusion

Asichurter/MalFusionFSL

Few-Shot malware classification using fused features of static analysis and dynamic analysis （基于静态+动态分析的混合特征的小样本恶意代码分类框架）

Python373Updated 2 days ago

few-shot-learningfew-shot-malware-classificationfused-featuresmalwaremalware-classificationmultimodal-fusionsimple

open-edge-platform/scenescape

Multimodal object tracking and scene analytics for highly actionable, real-world contextualized data

Python3637Updated 6 hours ago

multimodal-fusionmultimodal-object-trackingscene-analyticsscene-graphspatial-analysis

albrateanu/ModalFormer

[2025] ModalFormer: Multimodal Transformer for Low-Light Image Enhancement

Python253Updated 1 month ago

computer-visionimage-enhancementimage-processingimage-restorationimage-to-imagelow-lightlow-light-enhancelow-light-image-enhancementmultimodalmultimodal-fusionmultimodalitytransformertransformers

imadhou/multimodal-sentiment-analysis

Multimodal sentiment analysis

Jupyter Notebook237Updated 1 month ago

multimodal-classificationmultimodal-fusionmultimodal-learningmultimodal-sentiment-analysissentiment-analysissentiment-classificationtwitter-sentiment-analysis

brightest66/InfMasking

[NeurIPS 2025] Implementation of the paper "InfMasking: Unleashing Synergistic Information by Contrastive Multimodal Interactions".

Python211Updated 13 hours ago

multimodal-fusionmultimodal-learningself-supervised-learning

shengyangsun/MSBT

Official implementation of "Multi-scale Bottleneck Transformer for Weakly Supervised Multimodal Violence Detection"

Python203Updated 1 week ago

anomaly-detectionmultimodal-fusionweakly-supervised-learning

declare-lab/M2H2-dataset

This repository contains the dataset and baselines explained in the paper: M2H2: A Multimodal Multiparty Hindi Dataset For HumorRecognition in Conversations

Python1812Updated 1 year ago

emotion-recognition-in-conversationhumor-detectionmultimodal-deep-learningmultimodal-fusion

ai-forever/fbc2_aij2022

FusionBrain Challenge 2.0: creating multimodal multitask model

Python161Updated 2 years ago

multimodal-fusionmultitask-learning

icey-zhang/E2E-MFD-HOD

E2E-MFD-HOD

Python151Updated 9 months ago

multimodal-fusionobject-detection

kasunweerkoon/VAPOR

VAPOR: Legged Robot Navigation in Outdoor Vegetation using Offline Reinforcement Learning (ICRA2024)

Python132Updated 3 months ago

autonomous-navigationmotion-planningmultimodal-fusionreinforcement-learningrobotics

sverma88/Deep-HOSeq--ICDM-2020

Deep-HOSeq: Deep Higher-Order Sequence Fusion for Multimodal Sentiment Analysis.

Python113Updated 2 years ago

convolutional-neural-networksmultimodal-fusiontensortensor-factorization

sustainable-computing/Centaur

Repo for "Centaur: Robust Multimodal Fusion for Human Activity Recognition"

Jupyter Notebook101Updated 2 months ago

human-activity-recognitionmultimodal-fusionsensor-faults

AlfredsLapkovskis/MultimodalPlantClassifier

Source code for the paper "Automatic fused multimodal deep learning for plant identification" (Alfreds Lapkovskis, Natalia Nefedova & Ali Beikmohammadi, 2025)

Jupyter Notebook92Updated 1 month ago

architecture-searchdeep-learningfusion-automationmultimodal-fusionmultimodal-learningneural-architecture-searchplant-classificationplant-identification

JUSTM0VE0N/CF2N

Code for the paper: "A Novel Cross Fusion Model with Fine-grained Detail Reconstruction for Remote Sensing Image Pan-sharpening ", TGSI 2024.

Python80Updated 4 months ago

computer-visionimage-fusioninterpretable-machine-learninglow-level-visionmultimodal-fusionremote-sensing

brian-zZZ/Guided-PLI

A Transferability-guided Protein-Ligand Interaction Prediction Method

Python80Updated 1 year ago

multimodal-fusionprotein-ligand-interactionsrepresentation-learningtransferability

Moncef-Bj/GAF-Net-for-Video-Based-Person-Re-Identification

Official Pytorch Implementation of our paper: GAF-Net: Video-Based Person Re-Identification via Appearance and Gait Recognitions

Python80Updated 2 weeks ago

computer-visiondeep-learninggait-recognitionilids-vidmultimodal-fusionperson-reidperson-reidentificationpose-estimationpytorchskeleton-basedvideo-reid

RanFeng2/PAD

[TGRS2025] This is the official PyTorch implementation of "PAD: Phase-Amplitude Decoupling Fusion for Multi-Modal Land Cover Classification"

Python80Updated 2 days ago

frequency-domain-fusionmultimodal-fusionphase-amplitude-fusionsarsegmentation

Manishms18/Sign-Language-Advance

Contributed to a vision-driven accessibility tool translating sign language into text

Python70Updated 2 weeks ago

accessibilityai-for-accessibilityamerican-sign-languageaslcohens-kappaconfusion-matrixdata-augmentationdeep-learningefficientnetf1-scoregrukerasmultimodal-fusionrecurrent-neural-networksresnet50robustnesssign-language-recognitiontensorflowtransfer-learning

Page 1 of 3