This week takes me to Portland, OR to speak at PDX DevOps (tonight!). If you’re in PDX, be sure to come out. I’d love to also meet up for coffee with some Portland folks while I’m in town (today and tomorrow). If you want to talk app/infra monitoring or developer marketing, just reply and we’ll get together.
This issue is sponsored by:
450PB of Storage Capacity!!!
Want to help us protect over a quarter of a million computers, provide network connectivity via half a million access points, and remotely monitor and manage 2.5 million endpoints? You’ve probably never heard of us, but we’d like to hear from you! Click here to learn about opportunities at Datto!
From The Community
This is a great multi-part series on alert fatigue with tons of awesome suggestions.
- Part 1: avoidance and course correction
- Part 2: alert reduction with Sensu filters & token substitution
- Part 3: automating triage & remediation with check hooks & handlers
- Part 4: alert consolidation
The second worst thing behind “denies there’s an outage” is “admits to an outage but says nothing.” Incident communication is hard, and this article definitely helps.
My talk from DevOpsDays Kansas City is already up in blazingly fast time! I’ll be giving a longer version of this talk at the Open Source Monitoring Conference in Nuremberg, Germany in a couple weeks, too.
As of the recent Grafana 5.3 release, TimescaleDB now has a stronger integration with it, including much better query editor support.
The not-so-secret secret of many website performance apps is that they’re really just curl underneath. (though, Lighthouse is a big thing now, so maybe that’ll change in the future)
It’s really exactly as the title says. Looks like a promising project; I’m super interested to see where this going to head in the future.
This looks like a great dashboard framework from the folks at Netflix, but sadly, it doesn’t appear it’s open-sourced (yet). Still, a great read for those working on their own dashboard systems.
Metrics are useful for more than just for app performance: having such data allows you to understand your own team better, improve cycle time, and so much more. The folks at Indeed understood this and built a neat tool to help them collect and analyze non-app-performance metrics.
For those of you with Java apps and logging woes, fret no more.
Instrumenting C++ is probably the last thing you might think about when waking up in the morning, but this talk is pretty good–and in-depth.
This issue is sponsored by:
Modern cloud application architectures require a modern monitoring and analytics approach. Find out why SaaS leaders like Workday, Intuit, Box, and Reddit chose hosted metric analytics for real-time insights across all their engineering teams.
I’m speaking at PDX DevOps as a last-minute addition to the schedule. Come on out and hear some scary stories about the time I took down half of a 5000-person campus with a single missing keyword, the cutover-that-just-wouldn’t-end, and how my team did some really great engineering on a really dumb problem.
This event is co-located with KubeCon and it’s got one hell of a lineup.
I’ve opened a job board for monitoring and observability jobs. If you’ve got some monitoring/observability roles you’re trying to fill, how about heading on over there? I’ll be including them here in the newsletter as well.
See you next week!
– Mike (@mike_julian) Monitoring Weekly Editor