DevOps Operations Performance Platform

PagerDuty Blog

Subscribe to PagerDuty Blog: eMailAlertsEmail Alerts
Get PagerDuty Blog: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn


Top Stories by PagerDuty Blog

What do you do after you’ve experienced an incident and performed a post-mortem (or, postmortem)? That may seem like a simple question, or even a non-question; after all, it’s easy to think of the post-mortem as the last step in handling an incident. But it’s not. In many ways, what you do with an incident post-mortem can be as important as the post-mortem itself. Below, I explain why and offer tips on what to do after the post-mortem is complete. Why Post-Mortems? Before we take a closer look at that question, however, we need to look at an even more basic question: What is the function of a post-mortem, and what should it contain? An incident post-mortem serves the following basic functions: It provides a record of the incident, its cause and related symptoms, its resolution, and its impact for future reference. This can be important for both a future understandin... (more)

Cutting Alert Fatigue | @DevOpsSummit @PagerDuty #AI #DevOps #Monitoring

This is a guest post by Ilan Rabinovitch, Direct of Product Management at Datadog. The convergence of rapid feature development, automation, continuous delivery, and the shifting makeup of modern tech stacks has pushed monitoring requirements to a potentially overwhelming scale. But while the systems you need to monitor are complex, your monitoring strategy doesn't have to be. At Datadog, we see the demand for monitoring at scale as a product of four changes: Increasing number of infrastructure components (microservices, instances, containers) Frequency of code and configuration... (more)

How We Compute Today | @DevOpsSummit #DevOps #AI #APM #Monitoring

How We Compute Today: What Modern Infrastructure Looks Like By Michael Churchman Today's infrastructure is not your grandparents' IT infrastructure, nor is it the infrastructure from a generation ago. The days of punch cards, vacuum tubes, ferrite core memory, floppies, and dial-up Internet are over. Today's infrastructure is also not the IT infrastructure that it five years ago, or even a year ago for that matter. Modern infrastructure is changing constantly, and all that we can do is provide a snapshot of infrastructure at the moment, along with a general picture of where it's go... (more)

What is PagerCon Anyways?

Inception Our first annual, internal conference, PagerCon, was held on November 18th, 2015 in San Francisco, followed by another event in Toronto on December 1st, 2015. So much of our company’s culture is focused around empathy, and PagerCon is at the cornerstone of how we show that internally. I (a member of the organizing committee) wanted to take the opportunity to reflect back on how far we have come as an organization to make this grassroots technical conference come to life and to provide inspiration for your own organization’s tech conference. The idea of PagerCon came ... (more)

To Build or To Buy?

The typical techie will face every challenge with a simple question: “Can I build the solution myself?” And often, the question is valid enough that it gets some significant consideration. So, should we build or buy? Evaluation of an incident management solution seems to now invite this question as well. But how do you know when you should build or buy your incident management platform? Why Build? Sometimes the desire to build your own solution is based on the simple fact that procurement of a commercial solution is out of your hands. For example, enterprise IT has the budget fo... (more)