Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
Mar 26, 2025 - Python
Site reliability engineering (SRE) is a set of principles and practices that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals are to create scalable and highly reliable software systems. Site reliability engineering is closely related to DevOps, a set of practices that combine software development and IT operations, and SRE has also been described as a specific implementation of DevOps.
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
A curated list of amazingly awesome open-source sysadmin resources.
DevOps Roadmap for 2025. with learning resources
A curated list of Site Reliability and Production Engineering resources.
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
Terraform Pull Request Automation
⭐ 【开源书籍】深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。如发现错误,谢谢提issue
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
Site Reliability Engineer Interview Preparation Guide
Compilation of public failure/horror stories related to Kubernetes
StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see and ChatOps. Installer at
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
DevOps Tutorials
Kubernetes prompt info for bash and zsh
CDN Up and Running - Building a CDN from Scratch to Learn about CDN, Nginx, Lua, Prometheus, Grafana, Load balancing, and Containers.
A curated list of awesome DevOps platforms, tools, practices and resources
A checklist of anyone practicing Site Reliability Engineering
Learning Shell,Python,Golang,System,Network
Chaos Engineering Toolkit & Orchestration for Developers