Info: Zenodo’s user support line is staffed on regular business days between Dec 23 and Jan 5. Response times may be slightly longer than normal.

There is a newer version of the record available.

Published August 24, 2022 | Version v0.1-weights-maxx
Software Open

rwightman/pytorch-image-models: MaxxVit (CoAtNet, MaxVit, and related experimental weights)

  • 1. Weights & Biases
  • 2. independent
  • 3. Technical University of Darmstadt
  • 4. @huggingface
  • 5. Stony Brook Medicine
  • 6. MIT
  • 7. Kaggle Competition Master
  • 8. @toss
  • 9. Neuro Event Labs Oy
  • 10. Ajou University
  • 11. Indie developer
  • 12. University of Bern
  • 13. @NVIDIA
  • 14. MarkAny
  • 15. Mobility Technologies Co., Ltd.
  • 16. Xiamen University
  • 17. LG AI Research

Description

CoAtNet (https://arxiv.org/abs/2106.04803) and MaxVit (https://arxiv.org/abs/2204.01697) timm trained weights

Weights were created reproducing the paper architectures and exploring timm sepcific additions such as ConvNeXt blocks, parallel partitioning, and other experiments.

Weights were trained on a mix of TPU and GPU systems. Bulk of weights were trained on TPU via the TRC program (https://sites.research.google/trc/about/).

CoAtNet variants run particularly well on TPU, it's a great combination. MaxVit is better suited to GPU due to the window partitioning, although there are some optimizations that can be made to improve TPU padding/utilization incl using 256x256 image size (8, 8) windo/grid size, and keeping format in NCHW for partition attention when using PyTorch XLA.

Files

rwightman/pytorch-image-models-v0.1-weights-maxx.zip

Files (1.4 MB)

Additional details