Computer networks

Said, NainaNainaSaidLandsiedel, OlafOlafLandsiedel2025-07-092025-07-092025-09-01Computer Networks 269: 111437 (2025)https://hdl.handle.net/11420/56135Deploying large Deep Neural Networks with state-of-the-art accuracy on edge devices is often impractical due to their limited resources. This paper introduces EdgeBoost, a selective input offloading system designed to overcome the challenges of limited computational resources on edge devices. EdgeBoost trains and calibrates a lightweight model for deployment on the edge and, in addition, deploys a large, complex model on the cloud. During inference, the edge model makes initial predictions for input samples, and if the confidence of the prediction is low, the sample is sent to the cloud model for further processing, otherwise, we accept the local prediction. Through careful calibration, EdgeBoost reduces the communication cost by 55%, 27% and 20% for the CIFAR-100, ImageNet-1k and Stanford Cars datasets, respectively, when compared to an cloud-only solution while achieving on-par classification accuracy. Furthermore, EdgeBoost reduces the total inference latency from 148 ms to 123.84 ms per inference compared to a cloud-only solution. Our evaluation also shows that calibrating the edge model for such a collaborative edge–cloud setup results in accuracy gains of up to 8 percent point, compared to an uncalibrated edge model. Additionally, EdgeBoost, when used as an abstaining classifier, can improve accuracy by up to 9 percent points over an uncalibrated model. Finally, EdgeBoost outperforms the Early Exit and Entropy thresholding baselines and achieves comparable accuracy to state-of-the-art routing-based methods without the need for hosting the router on the edge.en1872-7069Computer networks2025Elsevierhttps://creativecommons.org/licenses/by/4.0/EdgeAI | Inference offloading | Lightweight models | MCU | Model calibration | Temperature scaling | TinyMLComputer Science, Information and General Works::006: Special computer methods::006.3: Artificial IntelligenceEdgeBoost: Confidence boosting for resource constrained inference via selective offloadingJournal Articlehttps://doi.org/10.15480/882.1536110.1016/j.comnet.2025.11143710.15480/882.15361Journal Article