Umeå University's logo

umu.sePublications
Change search
Link to record
Permanent link

Direct link
Nguyen, Chanh Le TanORCID iD iconorcid.org/0000-0002-9156-3364
Publications (10 of 20) Show all publications
Nguyen, C. L., Klein, C. & Elmroth, E. (2026). Efficient online application placement strategies in mobile edge clouds. In: Claus Pahl; Maarten van Steen (Ed.), Cloud Computing and Services Science: 14th International Conference, CLOSER 2024, Revised Selected Papers. Paper presented at 14th International Conference on Cloud Computing and Services Science, CLOSER 2024, Angers, France, May 2–4, 2024. (pp. 98-123). Springer Science+Business Media B.V.
Open this publication in new window or tab >>Efficient online application placement strategies in mobile edge clouds
2026 (English)In: Cloud Computing and Services Science: 14th International Conference, CLOSER 2024, Revised Selected Papers / [ed] Claus Pahl; Maarten van Steen, Springer Science+Business Media B.V., 2026, p. 98-123Conference paper, Published paper (Refereed)
Abstract [en]

Mobile Edge Clouds (MECs) are emerging as a key complement to centralized cloud infrastructures by bringing computing and storage resources closer to the network edge, thereby reducing network bandwidth, latency, and jitter. A critical challenge in leveraging MECs effectively is the application placement problem, which seeks to minimize operational costs while ensuring end-user Quality of Service (QoS). This problem is further complicated by user mobility, as applications must migrate to maintain optimal QoS, yet frequent migrations can lead to unnecessary bandwidth consumption due to state transfer. In this paper, we tackle the application placement problem for stateful applications in MEC environments. We model the dynamic workloads, applications, and infrastructure typical of MECs and define the associated costs: resource utilization, migration, and QoS degradation. Based on this model, we propose two online placement algorithms – Gale-Shapley-based and Follow-me – designed to minimize the total cost of operating applications. These algorithms are compared against an offline benchmark that has complete future knowledge. Experimental results demonstrate that both proposed algorithms efficiently place applications in MECs, achieving operating costs within 8% of the global optimum approximated by the offline algorithm. Furthermore, the Gale-Shapley-based algorithm outperforms the Follow-me algorithm, reducing operating costs by up to 17% and improving load balancing across MECs to mitigate resource scarcity.

Place, publisher, year, edition, pages
Springer Science+Business Media B.V., 2026
Series
Communications in Computer and Information Science, ISSN 1865-0929, E-ISSN 1865-0937 ; 2851
Keywords
Application placement, Mobile edge clouds, Optimization, Service orchestration
National Category
Computer Sciences Computer Systems
Identifiers
urn:nbn:se:umu:diva-250627 (URN)10.1007/978-3-032-17286-0_5 (DOI)2-s2.0-105030273381 (Scopus ID)9783032172853 (ISBN)9783032172860 (ISBN)
Conference
14th International Conference on Cloud Computing and Services Science, CLOSER 2024, Angers, France, May 2–4, 2024.
Available from: 2026-03-13 Created: 2026-03-13 Last updated: 2026-04-10Bibliographically approved
Nguyen, C. L. & Elmroth, E. (2026). Trust-aware routing for distributed generative AI inference at the edge. In: : . Paper presented at The 22nd Annual International Conference on Distributed Computing in Smart Systems and the Internet of Things (DCOSS-IoT 2026), Reykjavik, Iceland, June 22-24, 2026. Reykjavik, Iceland
Open this publication in new window or tab >>Trust-aware routing for distributed generative AI inference at the edge
2026 (English)Conference paper, Oral presentation only (Refereed)
Abstract [en]

Emerging deployments of Generative AI increasingly execute inference across decentralized and heterogeneous edge devices rather than on a single trusted server. In such environments, a single device failure or misbehavior can disrupt the entire inference process, making traditional best-effort peer-to-peer routing insufficient. Coordinating distributed generative inference therefore requires mechanisms that explicitly account for reliability, performance variability, and trust among participating peers.

In this paper, we present G-TRAC, a trust-aware coordination framework that integrates algorithmic path selection with system-level protocol design to ensure robust distributed inference.First, we formulate the routing problem as a Risk-Bounded Shortest Path computation and introduce a polynomial-time solution that combines trust-floor pruning with Dijkstra's search, achieving sub-millisecond median routing latency at practical edge scales, and remaining below 10 ms at larger scales.Second, to operationally support the routing logic in dynamic environments, the framework employs a Hybrid Trust Architecture that maintains global reputation state at stable anchors while disseminating lightweight updates to edge peers via background synchronization.

Experimental evaluation on a heterogeneous testbed of commodity devices demonstrates that G-TRAC significantly improves inference completion rates, effectively isolates unreliable peers, and sustains robust execution even under node failures and network partitions.

Place, publisher, year, edition, pages
Reykjavik, Iceland: , 2026
Keywords
Edge Computing, Edge Intelligence, Distributed LLM Inference, Risk-Bounded Systems, Trust-Aware Routing, Pipeline Parallelism
National Category
Computer Sciences Networked, Parallel and Distributed Computing
Research subject
Computer Science
Identifiers
urn:nbn:se:umu:diva-251612 (URN)
Conference
The 22nd Annual International Conference on Distributed Computing in Smart Systems and the Internet of Things (DCOSS-IoT 2026), Reykjavik, Iceland, June 22-24, 2026
Available from: 2026-03-31 Created: 2026-03-31 Last updated: 2026-04-01
Zaland, O., Nguyen, C. L., Pokorny, F. T. & Bhuyan, M. (2025). Federated learning for large-scale cloud robotic manipulation: opportunities and challenges. In: Proceedings of 2025 International Conferenceon Machine Learning and Cybernetics: . Paper presented at 24th International Conference on Machine Learning and Cybernetics, ICMLC 2025, Bali, Indonesia, 13-15 July, 2025. (pp. 254-261). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>Federated learning for large-scale cloud robotic manipulation: opportunities and challenges
2025 (English)In: Proceedings of 2025 International Conferenceon Machine Learning and Cybernetics, Institute of Electrical and Electronics Engineers (IEEE), 2025, p. 254-261Conference paper, Published paper (Refereed)
Abstract [en]

Federated Learning (FL) is an emerging distributed machine learning paradigm, where the collaborative training of a model involves dynamic participation of devices to achieve broad objectives. In contrast, classical machine learning (ML) typically requires data to be located on-premises for training, whereas FL leverages numerous user devices to train a shared global model without the need to share private data. Current robotic manipulation tasks are constrained by the individual capabilities and speed of robots due to limited low-latency computing resources. Consequently, the concept of cloud robotics has emerged, allowing robotic applications to harness the flexibility and reliability of computing resources, effectively alleviating their computational demands across the cloud-edge continuum. Undoubtedly, within this distributed computing context, as exemplified in cloud robotic manipulation scenarios, FL offers manifold advantages while also presenting several challenges and opportunities. In this paper, we present fundamental concepts of FL and their connection to cloud robotic manipulation. Additionally, we envision the opportunities and challenges associated with realizing efficient and reliable cloud robotic manipulation at scale through FL, where researchers adapt to design and verify FL models in either centralized or decentralized settings.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2025
Series
International Conference on Machine Learning and Cybernetics, ISSN 2160-133X, E-ISSN 2160-1348
Keywords
Cloud robotics, Federated learning, Robotic manipulation
National Category
Robotics and automation Computer Sciences
Identifiers
urn:nbn:se:umu:diva-251031 (URN)10.1109/ICMLC66258.2025.11280176 (DOI)2-s2.0-105031639263 (Scopus ID)9798331587369 (ISBN)
Conference
24th International Conference on Machine Learning and Cybernetics, ICMLC 2025, Bali, Indonesia, 13-15 July, 2025.
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Available from: 2026-04-02 Created: 2026-04-02 Last updated: 2026-04-02Bibliographically approved
Nguyen, C. L., Elmroth, E. & Bhuyan, M. (2025). Silent failures in stateless systems: rethinking anomaly detection for serverless computing. In: 2025 IEEE international conference on service-oriented system engineering (SOSE): . Paper presented at 2025 IEEE International Conference on Service-Oriented System Engineering (SOSE), Tucson, USA, July 21-24, 2025 (pp. 8-19). IEEE
Open this publication in new window or tab >>Silent failures in stateless systems: rethinking anomaly detection for serverless computing
2025 (English)In: 2025 IEEE international conference on service-oriented system engineering (SOSE), IEEE, 2025, p. 8-19Conference paper, Published paper (Refereed)
Abstract [en]

Serverless computing has redefined cloud application deployment by abstracting infrastructure and enabling on-demand, event-driven execution, thereby enhancing developer agility and scalability. However, maintaining consistent application performance in serverless environments remains a significant challenge. The dynamic and transient nature of serverless functions makes it difficult to distinguish between benign and anomalous behavior, which in turn undermines the effectiveness of traditional anomaly detection methods. These conventional approaches, designed for stateful and long-running services, struggle in serverless settings where executions are short-lived, functions are isolated, and observability is limited. 

In this first comprehensive vision paper on anomaly detection for serverless systems, we systematically explore the unique challenges posed by this paradigm, including the absence of persistent state, inconsistent monitoring granularity, and the difficulty of correlating behaviors across distributed functions. We further examine a range of threats that manifest as anomalies, from classical Denial-of-Service (DoS) attacks to serverless-specific threats such as Denial-of-Wallet (DoW) and cold start amplification. Building on these observations, we articulate a research agenda for next-generation detection frameworks that address the need for context-aware, multi-source data fusion, real-time, lightweight, privacy-preserving, and edge-cloud adaptive capabilities.

Through the identification of key research directions and design principles, we aim to lay the foundation for the next generation of anomaly detection in cloud-native, serverless ecosystems.

Place, publisher, year, edition, pages
IEEE, 2025
Series
Proceedings, ISSN 2640-8228, E-ISSN 2642-6587
Keywords
Serverless Computing, Cloud Computing, Edge Computing, Function-as-a-service, Anomaly Detection, DoS, Data Fusion, System Monitoring, Observability
National Category
Computer Sciences
Research subject
Computer Science; Computer Systems
Identifiers
urn:nbn:se:umu:diva-243592 (URN)10.1109/SOSE67019.2025.00006 (DOI)2-s2.0-105016200742 (Scopus ID)979-8-3315-8912-7 (ISBN)979-8-3315-8911-0 (ISBN)
Conference
2025 IEEE International Conference on Service-Oriented System Engineering (SOSE), Tucson, USA, July 21-24, 2025
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon Europe, 101092711
Available from: 2025-08-29 Created: 2025-08-29 Last updated: 2025-10-10Bibliographically approved
Nguyen, C., Bhuyan, M. & Elmroth, E. (2025). Taming cold starts: proactive serverless scheduling with model predictive control. In: Proceedings: 2025 IEEE 33rd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems. MASCOTS: Paris, France 21-23 October 2025. Paper presented at MASCOTS 2025: 33rd International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication System, Paris, France, October 21-23, 2025 (pp. 1-8). IEEE
Open this publication in new window or tab >>Taming cold starts: proactive serverless scheduling with model predictive control
2025 (English)In: Proceedings: 2025 IEEE 33rd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems. MASCOTS: Paris, France 21-23 October 2025, IEEE, 2025, p. 1-8Conference paper, Oral presentation only (Refereed)
Abstract [en]

Serverless computing has transformed cloud application deployment by introducing a fine-grained, event-driven execution model that abstracts away infrastructure management. Its on-demand nature makes it especially appealing for latency-sensitive and bursty workloads. However, the cold start problem, i.e., where the platform incurs significant delay when provisioning new containers, remains the Achilles' heel of such platforms. 

This paper presents a predictive serverless scheduling framework based on Model Predictive Control to proactively mitigate cold starts, thereby improving end-to-end response time. By forecasting future invocations, the controller jointly optimizes container prewarming and request dispatching, improving latency while minimizing resource overhead.

We implement our approach on Apache OpenWhisk, deployed on a Kubernetes-based testbed. Experimental results using real-world function traces and synthetic workloads demonstrate that our method significantly outperforms state-of-the-art baselines, achieving up to 85% lower tail latency and a 34% reduction in resource usage.

Place, publisher, year, edition, pages
IEEE, 2025
Series
IEEE International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, ISSN 1526-7539, E-ISSN 2375-0227
Keywords
Serverless, Cloud Computing, Orchestration, Cold Start, Function-as-a-service, Model Predictive Control, Prediction, Request Shaping
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:umu:diva-243593 (URN)10.1109/MASCOTS67699.2025.11283271 (DOI)2-s2.0-105031750750 (Scopus ID)979-8-3315-5761-4 (ISBN)979-8-3315-5760-7 (ISBN)
Conference
MASCOTS 2025: 33rd International Symposium on the Modeling, Analysis, and Simulation of Computer and Telecommunication System, Paris, France, October 21-23, 2025
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon Europe, 101092711
Available from: 2025-09-03 Created: 2025-09-03 Last updated: 2026-04-02Bibliographically approved
Nguyen, C. L., Seo, E., Zahid, M., Larsson, O., T. Pokorny, F. & Elmroth, E. (2025). tinyKube: a middleware for dynamic resource management in cloud-edge platforms for large-scale cloud robotics. In: The 38th IEEE/IFIP Network Operations and Management Symposium (NOMS): . Paper presented at IEEE/IFIP 2025, The 38th IEEE/IFIP Network Operations and Management Symposium (NOMS), Honolulu, HI, USA, May 12-16, 2025.
Open this publication in new window or tab >>tinyKube: a middleware for dynamic resource management in cloud-edge platforms for large-scale cloud robotics
Show others...
2025 (English)In: The 38th IEEE/IFIP Network Operations and Management Symposium (NOMS), 2025Conference paper, Published paper (Refereed)
Abstract [en]

With the rise of ubiquitous networking and distributed computing, integrating robots with cloud-edge infrastructures offers significant potential. However, challenges remain in resource allocation and scheduling across distributed environments to meet robotics applications' performance demands. 

This paper introduces tinyKube, a middleware tailored for dynamic resource management across the cloud-edge platform for large-scale cloud robotics deployments. Leveraging Kubernetes for orchestration and Prometheus for monitoring, tinyKube enables unified monitoring, task dispatching, and resource provisioning across cloud-edge infrastructures.

We evaluate tinyKube using a robotic gripper application on the CloudGripper testbed in a real-world cloud-edge setup. Results demonstrate its ability to automate task dispatching and resource allocation, dynamically adapting to QoS requirements and workload variations.By simplifying resource management, tinyKube accelerates the development, testing, and deployment of large-scale cloud robotics applications, facilitating more efficient real-world implementation.

Keywords
Cloud Robotics, Cloud-Edge Infrastructure, Dynamic Orchestration, Middleware, Performance Monitoring
National Category
Computer Vision and Robotics (Autonomous Systems) Computer Sciences Computer Systems
Research subject
Computer Science
Identifiers
urn:nbn:se:umu:diva-234935 (URN)10.1109/NOMS57970.2025.11073596 (DOI)2-s2.0-105012200792 (Scopus ID)979-8-3315-3163-8 (ISBN)979-8-3315-3164-5 (ISBN)
Conference
IEEE/IFIP 2025, The 38th IEEE/IFIP Network Operations and Management Symposium (NOMS), Honolulu, HI, USA, May 12-16, 2025
Projects
NEST-project: Cloud-Robotics
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Available from: 2025-02-16 Created: 2025-02-16 Last updated: 2025-08-08Bibliographically approved
Le Duc, T., Nguyen, C. L. & Östberg, P.-O. (2025). Workload prediction for proactive resource allocation in large-scale cloud-edge applications. Electronics, 14(16), Article ID 3333.
Open this publication in new window or tab >>Workload prediction for proactive resource allocation in large-scale cloud-edge applications
2025 (English)In: Electronics, E-ISSN 2079-9292, Vol. 14, no 16, article id 3333Article in journal (Refereed) Published
Abstract [en]

Accurate workload prediction is essential for proactive resource allocation in large-scale Content Delivery Networks (CDNs), where traffic patterns are highly dynamic and geographically distributed. This paper introduces a CDN-tailored prediction and autoscaling framework that integrates statistical and deep learning models within an adaptive feedback loop. The framework is evaluated using 18 months of real traffic traces from a production multi-tier CDN, capturing realistic workload seasonality, cache–tier interactions, and propagation delays. Unlike generic cloud-edge predictors, our design incorporates CDN-specific features and model-switching mechanisms to balance prediction accuracy with computational cost. Seasonal ARIMA (S-ARIMA), Long Short-Term Memory (LSTM), Bidirectional LSTM (Bi-LSTM), and Online Sequential Extreme Learning Machine (OS-ELM) are combined to support both short-horizon scaling and longer-term capacity planning. The predictions drive a queue-based resource-estimation model, enabling proactive cache–server scaling with low rejection rates. Experimental results demonstrate that the framework maintains high accuracy while reducing computational overhead through adaptive model selection. The proposed approach offers a practical, production-tested solution for predictive autoscaling in CDNs and can be extended to other latency-sensitive edge-cloud services with hierarchical architectures.

Place, publisher, year, edition, pages
MDPI, 2025
Keywords
workload modeling, workload prediction, resource allocation, prediction framework, ARIMA, LSTM, OS-ELM, CDN
National Category
Computer Sciences
Research subject
Computer Science; Computer Systems
Identifiers
urn:nbn:se:umu:diva-243464 (URN)10.3390/electronics14163333 (DOI)2-s2.0-105014373697 (Scopus ID)
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)EU, Horizon 2020, 732667
Available from: 2025-08-22 Created: 2025-08-22 Last updated: 2025-09-24Bibliographically approved
Nguyen, C. L., Kidane, L., Vo Nguyen Le, D. & Elmroth, E. (2024). CloudResilienceML: ensuring robustness of machine learning models in dynamic cloud systems. In: Proceedings - 2024 IEEE/ACM 17th International Conference on Utility and Cloud Computing, UCC 2024, IEEE, 2024, p. 73-81: . Paper presented at The 17th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 2024), University of Sharjah, Sharjah, United Arab Emirates, 16-19 December, 2024 (pp. 73-81). Institute of Electrical and Electronics Engineers (IEEE)
Open this publication in new window or tab >>CloudResilienceML: ensuring robustness of machine learning models in dynamic cloud systems
2024 (English)In: Proceedings - 2024 IEEE/ACM 17th International Conference on Utility and Cloud Computing, UCC 2024, IEEE, 2024, p. 73-81, Institute of Electrical and Electronics Engineers (IEEE), 2024, p. 73-81Conference paper, Published paper (Refereed)
Abstract [en]

Machine Learning (ML) models play a crucial role in enabling intelligent decision-making across diverse cloud system management tasks. However, as cloud operational data evolves, shifts in data distributions can occur, leading to a gradual degradation of deployed ML models and, consequently, a reduction in the overall efficiency of cloud systems.

We introduce CloudResilienceML, a framework designed to maintain the resilience of ML models in dynamic cloud environments. CloudResilienceML includes: (1) a performance degradation detection mechanism, using dynamic programming change point detection to identify when a model needs retraining, and (2) a data valuation method to select a minimal, effective training set for retraining, reducing unnecessary overhead.

Evaluated with two ML models on real cloud operational data, CloudResilienceML significantly boosts model resilience and reduces retraining costs compared to incremental learning and data drift-based retraining. In high-drift scenarios (e.g., Wikipedia trace), it reduces overhead by 50% compared to concept drift retraining and by 91% compared to incremental retraining. In stable environments (e.g., Microsoft Azure trace), CloudResilienceML maintains high accuracy with retraining costs 96% lower than concept drift methods and 86% lower than incremental retraining.

Place, publisher, year, edition, pages
Institute of Electrical and Electronics Engineers (IEEE), 2024
Keywords
Autonomous System, Change Point Detection, Cloud Operational Data, Data Drift, Machine Learning, Resource Management, Time series
National Category
Computer Sciences
Research subject
Computer Science
Identifiers
urn:nbn:se:umu:diva-231946 (URN)10.1109/UCC63386.2024.00019 (DOI)2-s2.0-105004740726 (Scopus ID)9798350367201 (ISBN)9798350367218 (ISBN)
Conference
The 17th IEEE/ACM International Conference on Utility and Cloud Computing (UCC 2024), University of Sharjah, Sharjah, United Arab Emirates, 16-19 December, 2024
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)eSSENCE - An eScience Collaboration
Available from: 2024-12-21 Created: 2024-12-21 Last updated: 2025-06-17Bibliographically approved
Nguyen, C., Bhuyan, M. & Elmroth, E. (2024). Enhancing machine learning performance in dynamic cloud environments with auto-adaptive models. In: The 15th IEEE International Conference on Cloud Computing Technology and Science (CloudCom 2024): Proceedings. Paper presented at The 15th IEEE International Conference on Cloud Computing Technology and Science (CloudCom 2024), Khalifa University, Abu Dabi, United Arab Emirates, 9-11 Dec, 2024 (pp. 184-191). IEEE
Open this publication in new window or tab >>Enhancing machine learning performance in dynamic cloud environments with auto-adaptive models
2024 (English)In: The 15th IEEE International Conference on Cloud Computing Technology and Science (CloudCom 2024): Proceedings, IEEE, 2024, p. 184-191Conference paper, Published paper (Refereed)
Abstract [en]

Autonomous resource management is essential for large-scale cloud data centers, where Machine Learning~(ML) enables intelligent decision-making. However, shifts in data patterns within operational streams pose significant challenges to sustaining model accuracy and system efficiency.

This paper proposes an auto-adaptive ML approach to mitigate the impact of data drift in cloud systems. A knowledge base of distinct time-series batches and corresponding ML models is constructed and clustered using the Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) algorithm. When model performance degrades, the system uses Dynamic Time Warping (DTW) to retrieve matching hyperparameters from the knowledge base and apply them to the deployed model, optimizing inference accuracy on new data streams.

Experiments with two real-world cloud data traces -- representing both stable and highly fluctuating environments - demonstrate that the proposed approach maintains high model accuracy (over 89%) while minimizing retraining costs. Specifically, for the Wikipedia trace with frequent data drift, retraining overhead is reduced by 22.9% compared to drift detection-based retraining and by 97\% compared to incremental retraining. In stable environments, like the Google cluster trace, retraining costs decrease by 96.3% and 88.9%, respectively.

Place, publisher, year, edition, pages
IEEE, 2024
Series
Proceedings (IEEE International Conference on Cloud Computing Technology and Science. Online), ISSN 2380-8004, E-ISSN 2330-2186
Keywords
Autonomous Cloud, Self-Adaptation, Cloud Operational Data, Data Drift, Machine Learning, Retraining
National Category
Computer Systems Computer Sciences
Research subject
Computer Science; Computer Systems
Identifiers
urn:nbn:se:umu:diva-231945 (URN)10.1109/CloudCom62794.2024.00024 (DOI)2-s2.0-85217015975 (Scopus ID)979-8-3315-0758-9 (ISBN)
Conference
The 15th IEEE International Conference on Cloud Computing Technology and Science (CloudCom 2024), Khalifa University, Abu Dabi, United Arab Emirates, 9-11 Dec, 2024
Funder
Wallenberg AI, Autonomous Systems and Software Program (WASP)
Available from: 2024-12-21 Created: 2024-12-21 Last updated: 2025-03-26Bibliographically approved
Iyengar, R., Dong, Q., Nguyen, C. L., Pillai, P. & Satyanarayanan, M. (2024). Offload shaping for wearable cognitive assistance. Electronics, 13(20), Article ID 4083.
Open this publication in new window or tab >>Offload shaping for wearable cognitive assistance
Show others...
2024 (English)In: Electronics, E-ISSN 2079-9292, Vol. 13, no 20, article id 4083Article in journal (Refereed) Published
Abstract [en]

Edge computing has much lower elasticity than cloud computing because cloudlets have much smaller physical and electrical footprints than a data center. This hurts the scalability of applications that involve low-latency edge offload. We show how this problem can be addressed by leveraging the growing sophistication and compute capability of recent wearable devices. We investigate four Wearable Cognitive Assistance applications on three wearable devices, and show that the technique of offload shaping can significantly reduce network utilization and cloudlet load without compromising accuracy or performance. Our investigation considers the offload shaping strategies of mapping processes to different computing tiers, gating, and decluttering. We find that all three strategies offer a significant bandwidth savings compared to transmitting full camera images to a cloudlet. Two out of the three devices we test are capable of running all offload shaping strategies within a reasonable latency bound.

Place, publisher, year, edition, pages
MDPI, 2024
Keywords
computer vision, machine learning, offloading, cyber foraging, wearable computing, mobile computing, edge computing, IoT, cloudlet, augmented reality
National Category
Computer Systems Communication Systems
Research subject
Computer Science
Identifiers
urn:nbn:se:umu:diva-231083 (URN)10.3390/electronics13204083 (DOI)
Available from: 2024-10-21 Created: 2024-10-21 Last updated: 2024-10-22Bibliographically approved
Organisations
Identifiers
ORCID iD: ORCID iD iconorcid.org/0000-0002-9156-3364

Search in DiVA

Show all publications