CEED: Collaborative early exit neural network inference at the edge
File(s)m48489-chen final.pdf (1.03 MB)
Accepted version
Author(s)
Chen, Yichong
Roveri, Manuel
Casale, Giuliano
Type
Conference Paper
Abstract
Collaborative inference at the edge has gained
traction in recent years as one of the main trends within edge computing. The early exit neural network (EENN) architecture supports this by balancing inference time and accuracy with configurable early exit thresholds within the neural network. Such thresholds enable the dynamic tuning of the processing latency of a job based on confidence scores. However, most distributed EENN setups use a preset confidence threshold and assume constant data arrivals. This assumption exposes the system to potential data loss due to finite memory capacity in the edge devices. To address these issues, we propose CEED, an AI-based optimization framework to enable collaborative EENN inference on a multilayer edge infrastructure. CEED
integrates an EENN predictor and a Loss ratio predictor to
rapidly evaluate confidence threshold configurations and job assignment to devices. Experiments conducted on a physical testbed show that CEED significantly improves existing EENN inference methods by striking a better balance between end-toend system loss ratio and EENN inference accuracy.
traction in recent years as one of the main trends within edge computing. The early exit neural network (EENN) architecture supports this by balancing inference time and accuracy with configurable early exit thresholds within the neural network. Such thresholds enable the dynamic tuning of the processing latency of a job based on confidence scores. However, most distributed EENN setups use a preset confidence threshold and assume constant data arrivals. This assumption exposes the system to potential data loss due to finite memory capacity in the edge devices. To address these issues, we propose CEED, an AI-based optimization framework to enable collaborative EENN inference on a multilayer edge infrastructure. CEED
integrates an EENN predictor and a Loss ratio predictor to
rapidly evaluate confidence threshold configurations and job assignment to devices. Experiments conducted on a physical testbed show that CEED significantly improves existing EENN inference methods by striking a better balance between end-toend system loss ratio and EENN inference accuracy.
Date Acceptance
2024-12-06
Publisher
IEEE
Copyright Statement
Subject to copyright. This paper is embargoed until publication. Once published the author’s accepted manuscript will be made available under a CC-BY License in accordance with Imperial’s Research Publications Open Access policy (www.imperial.ac.uk/oa-policy).
License URL
Source
IEEE INFOCOM 2025
Publication Status
Accepted
Start Date
2025-05-19
Finish Date
2025-05-22
Coverage Spatial
London, UK
Rights Embargo Date
10000-01-01