Did you know I have a podcast too? Check it out: Real World Devops
This issue is sponsored by:
A 5-part blog series highlighting how DevOps and IT teams are specifically reducing MTTA and MTTR over time through improved alerting, notifications, incident response, escalation, and post-incident reviews based on your monitoring data.
Latest Articles on monitoring.love
I spoke with Cory Watson on Real World DevOps recently. Cory is previously the lead on Observability at both Twitter and Stripe, and now a Technical Director at SignalFx, and we had a fantastic chat about the transition from being a customer to being a vendor, as well as a whole bunch of monitoring stuff.
From The Community
If you’re new to Prometheus’s query language, PromQL, this is a super handy guide.
The folks at Datadog seem to be on a logging kick lately. Got C# apps laying around? This article does a great job at introducing you to effective C# logging practices.
From the article: “Over the years, Booking.com’s Graphite grew to consist of hundreds of servers.” … “It ingests more than 10 million unique points per second” Holy crap.
It’s exactly as it sounds.
“Marie Cosgrove-Davies covers a user-focused approach to SLOs and some common pitfalls that teams encounter when they’re first trying to adopt SLO methods.”
When you’re running at Netflix scale, questions like, “Where does data come from? Where does it go? What changes are made to it?” is…tricky. This isn’t so much monitoring-related as it is data engineering–which is a much larger topic that many of you working on monitoring systems are probably thinking about too.
The folks at SemaText have written up a full review of the recent AWS Open Distro for Elasticsearch.
I think my favorite part of this is where they named their tool Chaos Panda. :D
Long live statsd. There, I said it. As much as I see statsd haters, it’s still improving monitoring for a lot of people. statsd is a fantastic starting point for improving monitoring, even today.
Seems the folks at Elastic have been busy, as there’s several new products that just dropped and they all look super useful: Elastic Logs, which provides a “a console-like experience” for logs, Elastic Infrastructure, which provides “a turnkey solution for monitoring infrastructure metrics”, and Elastic Uptime Monitoring, which handles basic availability testing a la Pingdom. All of them are available for free inside the Elastic Stack.
Because the security folks among you don’t get enough love in the monitoring world, here you go.
This issue is sponsored by:
The folks at Scalyr recently recorded a video about monitoring containers at scale. The tie between logs, containers, and orchestration is only growing in importance, and [Scalyr is here to help.(https://vimeo.com/scalyrlogmanagement/devscaas) My thanks to Scalyr for their support in making Monitoring Weekly possible.
It’s that time of year again: Sensu Summit’s CFP is open. Perfect excuse to go to Portland in September. ;)
For the LogicMonitor fans among you, the LogicMonitor Lever Up conference is this June. I’ll be speaking, so come hang out/heckle!
See you next week!
– Mike (@mike_julian) Monitoring Weekly Editor