Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

doi:10.5281/zenodo.5901781

Published October 25, 2021 | Version v1

Conference paper Open

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

1. University of Trento, Italy
2. ETH

It is hard to generate an image at target view well for previous cross-view image
translation methods that directly adopt a simple encoder-decoder or U-Net structure,
especially for drastically different views and severe deformation cases. To ease this
problem, we propose a novel two-stage framework with a new Cascaded Cross MLPMixer
(CrossMLP) sub-network in the first stage and one refined pixel-level loss in
the second stage. In the first stage, the CrossMLP sub-network learns the latent transformation
cues between image code and semantic map code via our novel CrossMLP
blocks. Then the coarse results are generated progressively under the guidance of those
cues. Moreover, in the second stage, we design a refined pixel-level loss that eases
the noisy semantic label problem with more reasonable regularization in a more compact
fashion for better optimization. Extensive experimental results on Dayton [40]
and CVUSA [42] datasets show that our method can generate significantly better results
than state-of-the-art methods. The source code and trained models are available at
https://github.com/Amazingren/CrossMLP.

Files

BinBMVC21.pdf

Files (5.6 MB)

Name	Size	Download all
BinBMVC21.pdf md5:4cc76a35674d04d5926306ff40b885ea	5.6 MB	Preview Download

Additional details

AI4Media – A European Excellence Centre for Media, Society and Democracy 951911: European Commission

	All versions	This version
Views	55	55
Downloads	51	51
Data volume	285.6 MB	285.6 MB

Cascaded Cross MLP-Mixer GANs for Cross-View Image Translation

Creators

Description

Files

BinBMVC21.pdf

Files (5.6 MB)

Additional details

Funding