Welcome To The World of DevOps. An ongoing & curated collection of awesome software, libraries, learning tutorials, tools and resources and cool stuff about DevOps.
-
Updated
Feb 6, 2024 - Python
Welcome To The World of DevOps. An ongoing & curated collection of awesome software, libraries, learning tutorials, tools and resources and cool stuff about DevOps.
An ongoing & curated collection of awesome SRE software and tools, libraries and frameworks, engineering books and blogs, philosophical principles, technical guidelines, practical tools about the field of Site Reliablity Engineering (SRE)
Write Bash executable runbooks in Markdown.
Automated incident detection and remediation using Azure SRE Agent and GitHub Copilot, from failure spike to merged fix PR with minimal human intervention.
Autonomous multi-agent SRE platform built on Azure using Microsoft Agent Framework and Azure OpenAI to detect, diagnose, and self-heal CI/CD incidents.
AI-powered incident triage system that converts raw logs into structured tickets and dashboards using a local LLM with built-in PII redaction and root cause analysis.
🌐 Discover top resources for Site Reliability Engineering, focusing on open-source tools and accessible knowledge to build scalable, reliable systems.
Add a description, image, and links to the sre-automation topic page so that developers can more easily learn about it.
To associate your repository with the sre-automation topic, visit your repo's landing page and select "manage topics."