Skip to content
/ TMF Public

Multimodal Text-Image Classification code

Notifications You must be signed in to change notification settings

dlutor/TMF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Preparation

pip install -r requirements.txt

Datasets

MVSA: Download from MVSA kaggle. Put data to datasets/MVSA_Single.

UPMC Food101: Download from UPMC Food101 kaggle. Put images to datasets/food101.

CrisisMMD: Download from CrisisMMD v2.0. Put data_image to datasets/CrisisMMD.

N24News: Download from N24News. Put imgs to datasets/N24News.

Train and Test

bash ./shells/train_MVSA.sh
bash ./shells/trainCrisisMMD_h.sh
bash ./shells/trainfood101.sh
bash ./shells/trainfood101_vit.sh
bash ./shells/trainN24News_a.sh

"# TMF Two-Stage Multimodal Fusion"

About

Multimodal Text-Image Classification code

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published