AoE-MTL: Gate-Free Expert Selection for Efficient Multi-Task Learning
Authors/Creators
Description
Multi-gate Mixture-of-Experts (MMoE) architectures use task-specific gating networks for expert selection, but these gates add parameters, tuning overhead, and can suffer from expert imbalance. We propose AoE-MTL, a gate-free expert architecture where experts self-activate using task-specific L2-norms of their outputs. Built on a CGC-style mix of shared and task-specific experts, AoE-MTL enables flexible task routing with zero gating parameters or auxiliary balancing losses. Because activation is computed directly from expert outputs, routing reflects each expert’s actual contribution for a given task. Across NYUv2, PASCAL-Context, Cityscapes, and Office-Home, AoE-MTL matches or surpasses MMoE, CGC, and PLE, achieving average improvements of 9.31%, 2.09%, 5.23%, and 14.21% over MMoE with fewer parameters. These results show that autonomous norm-based expert selection is a simple, stable, and effective alternative to explicit gating in multi-task learning.
Files
main.pdf
Files
(426.4 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:65968fffcf34d9b8b4733ea9dd585c4e
|
426.4 kB | Preview Download |
Additional details
Dates
- Created
-
2025-11-16
Software
- Repository URL
- https://github.com/anand-amon/aoemtl
- Programming language
- Python
- Development Status
- Active