No features have been listed yet.
No Google Site Reliability Engineering videos yet. You could help us improve this page by suggesting one.
Based on our record, Google Site Reliability Engineering seems to be a lot more popular than Apache Karaf. While we know about 86 links to Google Site Reliability Engineering, we've tracked only 1 mention of Apache Karaf. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Apache Karaf with OSGi works pretty nice using annotation based dependency injection with the declarative services, removing the need to mess with those hopefully archaic XML blueprints. Too bad it's not as trendy as spring and the developers so many of the tutorials can be a bit dated and hard to find. Karaf also supports many other frameworks and programming models as well and there's even Red Hat supported... Source: about 4 years ago
In 2025, observability is no longer just for SREs or DevOps—it’s a cross-functional necessity. Whether you’re debugging a production outage, tracking performance regressions, or optimizing user experience, your observability tools should provide clarity, not clutter. - Source: dev.to / 24 days ago
Same difference... Read the book https://sre.google/. - Source: Hacker News / 9 months ago
In my view it is having a dedicated team focusing their full mental bandwidth on pro-actively understanding and managing robustness of the system. In Pure DevOps, it seems to me developers often don't have the full picture of the system, and not enough bandwidth to foresee complex interactions from their changes. These are from my experiences spending one year as a developer in somewhat large a greenfield... - Source: Hacker News / over 1 year ago
Site Reliability Engineering, introduced by Google, extends the principles of software engineering to operations. Unlike DevOps, SRE places a stronger emphasis on reliability, availability, and scalability. SRE teams are tasked with maintaining the health and performance of systems by applying engineering practices to operations. The ultimate objective is to achieve a balance between service reliability and... Source: over 1 year ago
Define SLOs for availability and latency. Google's SRE book is good reading for this. Source: almost 2 years ago
Docker - Docker is an open platform that enables developers and system administrators to create distributed applications.
Open Telemetry - An observability framework for cloud-native software.
Google App Engine - A powerful platform to build web and mobile apps that scale automatically.
Ganeti - Ganeti is a cluster management tool built on top of existing virtualization technologies.
Amazon S3 - Amazon S3 is an object storage where users can store data from their business on a safe, cloud-based platform. Amazon S3 operates in 54 availability zones within 18 graphic regions and 1 local region.
Apache Helix - A cluster management framework for partitioned and replicated distributed resources