I don’t know about you all, but I love that there’s something different and unique to share with you every week. A great collection of practical advice and aspirational projects from the community, plus an explanation of the logging changes included with Kubernetes 1.24. Enjoy! 😸💾🔎
This issue is sponsored by:
Promscale is an observability backend built on the rock-solid foundation of PostgreSQL and TimescaleDB. With the new support for OpenTelemetry traces, you now have full SQL superpowers to solve even the most complex issues in your distributed systems. Read more about it here.
Articles & News on monitoring.love
Come hang out with all your fellow Monitoring Weekly readers. I mean, I’m also there, but I’m sure everyone else is way cooler.
From The Community
An excellent follow-up to the author’s previous post, this article looks at how to leverage OpenTelemetry to detect and diagnose database performance issues.
A guide for determining the appropriate golden signals for your respective service and workload types.
How to build your own service discovery mechanism for Thanos with Consul and Prometheus’ built-in primitives.
Kubernetes 1.24 includes a number of enhancements to structured and contextual logging. Great to see the scoping improvements and the benefits this will have in scenarios like unit tests.
Some very pragmatic lessons learned from a small team about a customer-impacting outage. It’s easy to pick at their obvious mistakes, but who of us hasn’t gone through some of these in production?
Although this article is focused on continuous delivery, it’s a topic that very relevant to my current interests and has me thinking there should be more collaboration between product, platform, and observability teams in terms of plugging into our existing monitoring infrastructure.
A concise primary on observability, how to approach it for different types of systems, and a list of the most useful open source tools in the space.
Get your incidents under control and focus on fixing 🔥
Being on-call at 2am when everything is on fire can be stressful! incident.io makes responding to incidents quick and easy by automating your entire incident management process in Slack. Sign up for free. (SPONSORED)
How the Domain-Oriented Observability pattern can result in more efficient code and better accountability among software teams.
The title really says it all. Check out the related GitHub project linked in the Tools section below for the related source code.
A deep dive on NGINX logs and how to configure them effectively.
Timescale explains how they monitor their own services (“who watches the watcher”) and how you can apply these techniques to your own Promscale installation.
“A GitOps Way To Manage Grafana Data Sources At Scale”
Monitorama is returning to Portland, OR this summer. It looks like a return to form for one of our favorite events (ok, we might be biased). Hope to see you there!
See you next week!
– Jason (@obfuscurity) Monitoring Weekly Editor