CEED: Collaborative early exit neural network inference at the edge

Chen, Yichong; Roveri, Manuel; Casale, Giuliano

File(s)

m48489-chen final.pdf (1.03 MB)

Accepted version

Author(s)

Chen, Yichong

Roveri, Manuel

Casale, Giuliano

Type

Conference Paper

Abstract

Collaborative inference at the edge has gained
traction in recent years as one of the main trends within edge computing. The early exit neural network (EENN) architecture supports this by balancing inference time and accuracy with configurable early exit thresholds within the neural network. Such thresholds enable the dynamic tuning of the processing latency of a job based on confidence scores. However, most distributed EENN setups use a preset confidence threshold and assume constant data arrivals. This assumption exposes the system to potential data loss due to finite memory capacity in the edge devices. To address these issues, we propose CEED, an AI-based optimization framework to enable collaborative EENN inference on a multilayer edge infrastructure. CEED
integrates an EENN predictor and a Loss ratio predictor to
rapidly evaluate confidence threshold configurations and job assignment to devices. Experiments conducted on a physical testbed show that CEED significantly improves existing EENN inference methods by striking a better balance between end-toend system loss ratio and EENN inference accuracy.

Date Acceptance

2024-12-06

URI

http://hdl.handle.net/10044/1/116253

Publisher

IEEE

Copyright Statement

Subject to copyright. This paper is embargoed until publication. Once published the author’s accepted manuscript will be made available under a CC-BY License in accordance with Imperial’s Research Publications Open Access policy (www.imperial.ac.uk/oa-policy).

License URL

https://creativecommons.org/licenses/by/4.0/

Source

IEEE INFOCOM 2025

Publication Status

Accepted

Start Date

2025-05-19

Finish Date

2025-05-22

Coverage Spatial

London, UK

Rights Embargo Date

10000-01-01