Repository navigation

#

sre-team

SLOs, Error windows and alerts are complicated. Here an attempt to make it easy

Go
132
7 个月前

A simple framework for sharing Bash profiles, reusable shell libraries, and commands across hosts and teams. Contains builtin libraries for common functions like logging, error handling, and assertions. Built with SRE / DevOps teams in mind.

Shell
110
2 年前

Welcome To The World of DevOps. An ongoing & curated collection of awesome software, libraries, learning tutorials, tools and resources and cool stuff about DevOps.

Python
90
2 年前

An ongoing & curated collection of awesome SRE software and tools, libraries and frameworks, engineering books and blogs, philosophical principles, technical guidelines, practical tools about the field of Site Reliablity Engineering (SRE)

26
4 年前

Kubernetes operator that manages access and secrets for MongoDB clusters

Go
6
4 个月前

Circleci orb for the Prometheus tools cli. Useful for CI/CD of the Prometheus configuration.

4
3 年前

A ChatOps SRE toil elimination tool

Go
4
2 年前

Cloud-Army-Secret-Injector read secrets from GCP Secret Manager and automatically injects the values as environment variables to the application subprocess.

Go
4
2 年前

InfraGenius is a comprehensive AI-powered platform designed specifically for DevOps, SRE, Cloud, and Platform Engineering professionals. It provides industry-level expertise through advanced AI models, optimized for infrastructure operations, reliability engineering, and cloud architecture.

Python
2
6 天前

In the RahBia Live Coding Series, we’ll walk through a complete DevOps journey from start to finish. Together, we'll cover every step—from initial server configuration to final production-ready service deployment that mr AhmadRafiee is hosting it

Python
2
3 个月前

Site Reliability Engineer Space

HCL
1
5 个月前

A user-friendly tool for real-time tracking of infrastructure release versions and environment health status. It checks the health of environments in real-time from the PostgreSQL database.

TypeScript
1
9 个月前

Config files for my GitHub profile.

0
3 年前

DevProbe is a progressive web application that provides a platform for Site Reliability Engineers to monitor their websites. The app is built with , IONIC, Angular and Firebase.

TypeScript
0
10 个月前

This project focuses on how to build a scalable and highly available cloud infrastructure on Microsoft Azure using an Azure Load Balancer and Virtual Machine Scale Sets (VMSS). It distributes traffic across multiple VMs to ensure high availability and performance.

PowerShell
0
1 个月前

An all-in-one platform for alert management that integrates seamlessly with your observability solutions, regardless of the underlying technology.

TypeScript
0
3 个月前