Repository navigation

#

self-healing

CloudNativePG is a comprehensive platform designed to seamlessly manage PostgreSQL databases within Kubernetes environments, covering the entire operational lifecycle from initial deployment to ongoing maintenance

Go
6613
1 小时前
linkedin/cruise-control

Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.

Java
2901
21 天前

Infrastructure monitoring framework turning DevOps runbooks into automated actions

Python
394
7 年前

A collection of cluster reliability tools for Kubernetes

Go
129
6 小时前
C++
87
7 小时前

A collection of nodes for improve Node-RED dependability by enabling self-healing capabilities.

JavaScript
39
14 天前

An Opined Self Healing System

Python
25
2 年前

An experimental cluster brings Prometheus and OpenStack together

Go
19
2 年前

Side project, customizable trading bot.

Go
18
4 年前

The DeMoD Communications Framework (DCF) is an open-source evolution of the DeMoD Secure Protocol (DSP), designed as a shareware version to promote community-driven development

C
16
几秒前

A Docker Healer - Auto Restarting Unhealthy Containers

Python
12
9 年前

Java application for orchestrating AEM infrastructure created using aem-aws-stack-builder

Java
12
1 年前

Aegis is an LLM-powered AI cluster autonomous operations system, focused on intelligent capabilities such as Fault Diagnosis, Self-healing, Root Cause Analysis, Cluster Inspection, and Alert Optimization.

Go
9
6 天前

A Terraform module to quickly deploy a secure, persistent, highly available, self healing, efficient, cost effective and self managed single-node or multi-node MongoDB NoSQL document database cluster on AWS ECS cluster with monitoring and alerting enabled.

HCL
9
9 个月前

Turn photos into forcefield particle animations in real-time. js + canvas, open source

JavaScript
9
9 个月前

A monitoring and healing framework. It uses Pester tests (TDD) to determine the state of an IT system entity. Then, if the entity is in a faulted state HealOps will try to repair it. All along HealOps reports metrics to a backend report system and HealOps status is sent to stakeholders. In order to e.g. trigger alarms and get on-call personnel on an issue that could not be repaired.

PowerShell
7
6 年前

OpenFero is intended as an event-triggered job scheduler for code agnostic recovery jobs.

Go
7
4 小时前