incident.io is a Slack-native incident response and management tool that scales as your team grows. Hypergrowth companies use incident.io to automate incident processes, focus on fixing the issue, and learn from incident insights to improve site reliability and fix vulnerabilities. Learn more and see how it works on incident.io.
No Thanos.io videos yet. You could help us improve this page by suggesting one.
incident.io might be a bit more popular than Thanos.io. We know about 31 links to it since March 2021 and only 29 links to Thanos.io. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
There are SaaS products out there that can help with data collection like incident.io or firehydrant.io to more quickly construct a timeline. Source: about 1 year ago
My new favourite is https://incident.io. Great UI, great product, especially if you also need an incident management tool. Source: about 1 year ago
We did a pretty detailed write-up about a significant incident we had a few months back at incident.io: https://incident.io/blog/intermittent-downtime. Source: about 1 year ago
Co-founder of incident.io here, so I'll avoid throwing my thoughts around for obvious reasons. Source: about 1 year ago
I work at a company that offers a platform for this called https://incident.io/. Source: over 1 year ago
Monitoring = netdata on each RPi https://www.netdata.cloud/ binded to the vpn interface being scraped into a prometeus thaons https://thanos.io/ setup with grafana to give management the Green all is good screens (very important). Source: 5 months ago
Sounds like you want something like Thanos. Source: 11 months ago
Yes, but also no. The Prometheus ecosystem already has two FOSS time-series databases that are complementary to Prometheus itself. Thanos and Mimir. Not to mention M3db, developed at Uber, and Cortex, then ancestor of Mimir. There's a bunch of others I won't mention as it would take too long. Source: 11 months ago
Long term storage all depends on your needs and sophistication. I use Thanos for our system since it has an extremely flexible scaling system. But there is also Grafana Mimir. They're both similar in that they use Prometheus TSDB format as part of the underlying storage. One nice Thanos advantage is that it does do downsampling in addition to being able to store raw metric data for a long time. It will auto-select... Source: about 1 year ago
You can aggregate all your clusters Prometheus metrics together with a wonderful tool called Thanos. This will allow you to use just a single Grafana instance against Thanos and using a label select which cluster you wish to see metrics from. The downside of this, is that none of the Grafana dashboards from the internet will work as-is. You'll need to customize all of them for Thanos support. The other... Source: about 1 year ago
FireHydrant.io - FireHydrant helps teams organize and remedy incidents quickly when their system experience disruptions.
Prometheus - An open-source systems monitoring and alerting toolkit.
Rootly - Rootly helps build a consistent incident response process by automating manual admin work like creating incident channels, Jira tickets, Zoom rooms, and generating postmortem timelines, all from within Slack.
OpenCensus - Application and Data, Monitoring, and Monitoring Tools
PagerDuty - Cloud based monitoring service
Cortex Project - Horizontally scalable, highly available, multi-tenant, long term Prometheus.