fastai-multimodal

Purpose:

To create end-to-end multimodal classifers based on Fastai-tabular, Fastai-text and Fastai-vision.

Specifically, I will construct 3 types of multimodal model:

early concat: concatinate cnt, cat, txt, img after data loading and data preprocessing, followed by a learner of choice (e.g. fastai tabular, TabNet, Deep-RF, GSN-VSN).
middle concat: concatinate the embeddings from each of the trained tab (cnt+cat), txt, img models, followed by a learner of choice.
late concat: concatinate the probability predictions from each of the trained tab(cnt+cat), txt, img models, followed by a learner of choice.

Using a few benchmark datasets, I will compare the 3 types of multimodal models on their

Every iteration, I am aiming to make this package 5% better, w.r.t.

Check out these notebooks here and here. Any advices and comments are welcomed. Please shot me an email here.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
data		data
docs		docs
nbdev_colab		nbdev_colab
nbs		nbs
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
__init__.py		__init__.py
settings.ini		settings.ini
setup.py		setup.py