KQL for Adults: Writing Queries That Don't Lie to You

March 17, 2026 in Azure, Site Reliability Engineering | Reading time: 8 minutes

Most KQL running in production is subtly wrong. Wrong operators, unscoped subqueries, and alert rules that silently miss events due to ingestion latency. Here’s how to write queries you can actually defend.

Why Your Application Gateway Logs Don't Tell the Whole Story (Until You Correlate Them)

March 10, 2026 in Azure, Site Reliability Engineering | Reading time: 8 minutes

Access logs, firewall logs, backend health, and metrics each tell a partial truth about what Application Gateway is doing. Here’s how they mislead you in isolation, and the KQL that fixes that.

Your Alerts Are a Product. They're Just a Bad One.

February 24, 2026 in Azure, Systems Design, Site Reliability Engineering | Reading time: 14 minutes

Alert fatigue isn’t a people problem, it’s a product design failure. Your on-call engineers are the users. Here’s why noisy alerts are biologically inevitable under bad design, and what treating alerting as a product actually looks like.

The Hidden Cost of 'Just Turn On Logging' in Azure

January 27, 2026 in Azure, Cloud Engineering, FinOps | Reading time: 23 minutes

Your team enabled logging everywhere, a responsible move. Then the Azure bill arrived. This post traces exactly why Log Analytics costs spiral without warning, what the AzureDiagnostics table is quietly doing to your budget, and how resource-specific tables plus DCR transformations give you back control.

Ebby Peter

Blueprint. Build. Ship. Repeat!

Cloud Architect | Cloud Consultant | Cloud Enthusiast

Wellington, New Zealand