Tango: Low Latency Multi-DNN Inference on Heterogeneous Edge Platforms

Taufique, Zain; Vyas, Aman; Miele, Antonio; Liljeberg, Pasi; Kanduri, Anil

doi:10.1109/ICCD63220.2024.00053

Published January 2, 2025 | Version v1

Conference paper Open

Tango: Low Latency Multi-DNN Inference on Heterogeneous Edge Platforms

1. University of Turku
2. Politecnico di Milano

Running deep neural network (DNN) applications on edge platforms requires low-latency inference. However, scheduling multiple DNN workloads with varying compute and latency needs on resource-constrained edge devices is challenging. This work introduces Tango, a framework that optimizes multi-DNN inference on heterogeneous edge platforms. Using a reinforcement learning agent, Tango balances cluster selection, accuracy configuration, and frequency scaling to minimize latency while maintaining acceptable accuracy. Implemented as portable middleware on Jetson TX, Tango achieves 61% lower latency and 48.4% lower energy consumption, with a maximum accuracy loss of 1.59%, outperforming existing scheduling strategies.

Files

tango_iccd24.pdf

Files (1.1 MB)

Name	Size	Download all
tango_iccd24.pdf md5:7e0902b9057f88a2d9c60f14f540c1df	1.1 MB	Preview Download

Additional details

DOI: 10.1109/ICCD63220.2024.00053

European Commission
APROPOS - Approximate Computing for Power and Energy Optimisation 956090

Available: 2025-02-01

Views

Downloads

Show more details

	All versions	This version
Views	27	27
Downloads	33	33
Data volume	42.2 MB	42.2 MB

More info on how stats are collected....

DOI

Resource type

Conference paper

Publisher

Zenodo

Conference

International Conference on Computer Design (ICCD), Milan, Italy, 18-20 November 2024

Languages

English

License: Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 17, 2025
Modified: March 17, 2025

tango_iccd24.pdf

Files (1.1 MB)

Identifiers

Funding

Dates

Tango: Low Latency Multi-DNN Inference on Heterogeneous Edge Platforms

Authors/Creators

Description

Files

tango_iccd24.pdf

Files (1.1 MB)

Additional details

Identifiers

Funding

Dates