The ECO4AI partial inference platform
Authors/Creators
- 1. REDS, School of Engineering and Management Vaud, HES-SO University of Applied Sciences and Arts Western Switzerland
Contributors
- 1. REDS, School of Engineering and Management Vaud, HES-SO University of Applied Sciences and Arts Western Switzerland
Description
In recent years, Artificial Intelligence (AI) has seen a remarkable expansion. Consequently, AI adoption has become widespread, spanning domains like autonomous driving, video surveillance, and e-health. Traditionally, applications primarily relied on edge computing, closer to data sources. However, edge devices alone may not be able to handle modern AI applications. The primary objective of this paper is to propose a workload allocation framework that efficiently distributes inference tasks between edge and cloud, enhancing performance for AI-based IoT applications. Our test environment consists of a resource-constrained drone running a custom DNN model for obstacle detection that showcases 80% accuracy and 71% F2-score on the test set. Our proposed online decision algorithm, based on multi-armed contextual bandits, runs on the drone and chooses the optimal cutting point for each iteration on the fly. The decision is based on both predicted back-end delays (i.e. data transfer and inference time on the server) and observed front-end delays (i.e. inference time on the edge device). The algorithm achieves 83% accuracy and 89% Top3 accuracy in predicting the ideal cutting point on simulations. Our findings demonstrate the flexibility of the allocation in response to dynamic real-world environments, allowing computation offloading when the network becomes efficient while relying on edge computing when network quality deteriorates. Our proposed DNN and partitioning algorithms are deployed and tested in an ultra-low power GAP8 microprocessor. This involves key steps including quantization, GAP8 code generation from an ONNX model, and optimization to make our algorithms run on the drone and avoid obstacles.
Files
ECO4AI-main.zip
Files
(26.7 MB)
| Name | Size | Download all |
|---|---|---|
|
md5:2de53f5e29331450197d449b57bc1fe4
|
26.7 MB | Preview Download |
Additional details
Funding
- HES-SO Vaud
- This work has been funded by the ECO4AI project granted by HES-SO, under the call for young researchers 2021. 115090
Software
- Repository URL
- https://github.com/reds-heig/ECO4AI/
- Programming language
- Python , C++
- Development Status
- Active