Published November 16, 2025 | Version 1.0.0

AoE-MTL: Gate-Free Expert Selection for Efficient Multi-Task Learning

  • 1. ROR icon National Taipei University of Technology
  • 2. ROR icon Institute of Information Science, Academia Sinica
  • 3. Artificial Intelligence Research and Development Department, ELAN Microelectronics Corporation

Description

Multi-gate Mixture-of-Experts (MMoE) architectures use task-specific gating networks for expert selection, but these gates add parameters, tuning overhead, and can suffer from expert imbalance. We propose AoE-MTL, a gate-free expert architecture where experts self-activate using task-specific L2-norms of their outputs. Built on a CGC-style mix of shared and task-specific experts, AoE-MTL enables flexible task routing with zero gating parameters or auxiliary balancing losses. Because activation is computed directly from expert outputs, routing reflects each expert’s actual contribution for a given task. Across NYUv2, PASCAL-Context, Cityscapes, and Office-Home, AoE-MTL matches or surpasses MMoE, CGC, and PLE, achieving average improvements of 9.31%, 2.09%, 5.23%, and 14.21% over MMoE with fewer parameters. These results show that autonomous norm-based expert selection is a simple, stable, and effective alternative to explicit gating in multi-task learning.

Files

main.pdf

Files (426.4 kB)

Name Size Download all
md5:65968fffcf34d9b8b4733ea9dd585c4e
426.4 kB Preview Download

Additional details

Dates

Created
2025-11-16

Software

Repository URL
https://github.com/anand-amon/aoemtl
Programming language
Python
Development Status
Active