Risk-aware resilient cloud orchestration under correlated faults using adaptive dual-regime search with self-healing control

Close-up view of server rack network infrastructure with blue and yellow ethernet cables connected to network equipment and dark server chassis, with blurred server racks and indicator lights visible in the background.
Image Credit: Photo by Brett Sayles on Pexels (SourceLicense)

AI Summary of Peer-Reviewed Research

This page presents an AI-generated summary of a published research paper. The original authors did not write or review this article. See full disclosure ↓

Engineering Science and Technology an International Journal·2026-04-02·Peer-reviewed·View original paper ↗·Follow this topic (RSS)
Publication Signals show what we were able to verify about where this research was published.STRONGWe verified multiple publication signals for this source, including independently confirmed credentials. Publication Signals reflect the source’s verifiable credentials, not the quality of the research.
  • ✔ Peer-reviewed source
  • ✔ Published in indexed journal
  • ✔ No retraction or integrity flags

Key findings from this study

This research indicates that:

  • Conditional value-at-risk penalisation in multi-objective orchestration successfully balances fault tolerance and latency constraints under correlated hardware failures.
  • Event-triggered self-healing policies with bounded-switching guarantees reduce repair cost volatility without excessive controller switching.
  • Statistically validated performance gains emerge across five stress scenarios combining correlated faults with workload surges, relative to established cloud orchestration baselines.

Overview

The paper presents a fuzzy hybrid reptile–mamba optimisation framework addressing resource allocation in cloud platforms subject to correlated hardware faults and competing performance demands. The framework formulates orchestration as a multi-objective risk-constrained optimisation problem integrating fault tolerance, response latency, and recovery overhead. A novel black mamba operator, derived from predatory strike kinematics, combines with reptile search under interval type-2 fuzzy control to balance exploration and intensification.

Methods and approach

The framework employs conditional value-at-risk penalisation to model risk across multiple competing objectives. A black mamba operator provides local intensification, fused with reptile search for exploration under interval type-2 fuzzy mode selection. An event-triggered self-healing policy incorporates a local sufficient-decrease guarantee for repair costs and hysteresis-based bounded switching for controller selection. Experimental validation used CloudSim Plus simulation with Google Cluster Trace data across 30 independent runs per scenario.

Results

Experiments across five stress scenarios—including correlated faults and bursty workloads—demonstrated statistically significant improvements in the proposed framework relative to baselines. Friedman and Wilcoxon nonparametric tests confirmed performance gains across resource allocation, latency minimisation, and recovery overhead reduction. The framework maintained feasibility under combined fault and workload stresses that induced constraint violations in comparative methods.

Implications

The risk-aware formulation via conditional value-at-risk extends cloud orchestration beyond nominal conditions, addressing practical scenarios involving simultaneous hardware failures and load surges. The integration of self-healing policies with formal guarantees on repair cost decrease and controller stability reduces manual intervention in degraded operational states. The approach establishes a pathway for incorporating bio-inspired operators with rigorous control-theoretic guarantees in cloud resource management.

Scope and limitations

This summary is based on the study abstract and available metadata. It does not include a full analysis of the complete paper, supplementary materials, or underlying datasets unless explicitly stated. Findings should be interpreted in the context of the original publication.

Disclosure

  • Research title: Risk-aware resilient cloud orchestration under correlated faults using adaptive dual-regime search with self-healing control
  • Authors: Michaelraj Kingston Roberts, Sampath Kumar Shanmugam, Sarah M. Alhammad, Doaa Sami Khafaga
  • Institutions: Princess Nourah bint Abdulrahman University, Sri Eshwar College of Engineering
  • Publication date: 2026-04-02
  • DOI: https://doi.org/10.1016/j.jestch.2026.102365
  • OpenAlex record: View
  • Image credit: Photo by Brett Sayles on Pexels (SourceLicense)
  • Disclosure: This post was generated by Claude (Anthropic). The original authors did not write or review this post.

Get the weekly research newsletter

Stay current with peer-reviewed research without reading academic papers — one filtered digest, every Friday.

More posts