Do the Frankenstein, or how to achieve better out-of-distribution performance with manifold mixing model soups

Fassold, Hannes

doi:10.5281/zenodo.8208680

Published August 2, 2023 | Version v1

Conference paper Open

Do the Frankenstein, or how to achieve better out-of-distribution performance with manifold mixing model soups

Fassold, Hannes¹

1. JOANNEUM RESEARCH

The standard recipe applied in transfer learning is to finetune a pretrained model on the task-specific dataset with different hyperparameter settings and pick the model with the highest accuracy on the validation dataset. Unfortunately, this leads to models which do not perform well under distribution shifts, e.g. when the model is given graphical sketches of the object as input instead of photos. In order to address this, we propose the manifold mixing model soup, an algorithm which mixes together the latent space manifolds of multiple finetuned models in an optimal way in order to generate a fused model. We show that the fused model gives significantly better out-of-distribution performance (+3.5 % compared to best individual model) when finetuning a CLIP model for image classification. In addition, it provides also better accuracy on the original dataset where the finetuning has been done.

Files

Manifold_mixing_model_soups_imvip_2023_final.pdf

Files (880.1 kB)

Name	Size	Download all
Manifold_mixing_model_soups_imvip_2023_final.pdf md5:7ebef2bb75bd795e9416aefb6a612e1b	880.1 kB	Preview Download

Additional details

European Commission
AI4Media - A European Excellence Centre for Media, Society and Democracy 951911

567

Views

432

Downloads

Show more details

	All versions	This version
Views	567	563
Downloads	432	429
Data volume	450.6 MB	448.0 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

Irish Machine Vision and Image Processing Conference 2023 (IMVIP 2023) , Galway, Ireland, 30/08/2023 - 01/09/2023 (Session New Datasets and Misc. Applications)

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: August 2, 2023
Modified: July 11, 2024

Do the Frankenstein, or how to achieve better out-of-distribution performance with manifold mixing model soups

Authors/Creators

Description

Files

Manifold_mixing_model_soups_imvip_2023_final.pdf

Files (880.1 kB)

Additional details

Funding