DevOps Operations Performance Platform

PagerDuty Blog

Subscribe to PagerDuty Blog: eMailAlertsEmail Alerts
Get PagerDuty Blog: homepageHomepage mobileMobile rssRSS facebookFacebook twitterTwitter linkedinLinkedIn


Top Stories by PagerDuty Blog

How to Learn from Failure in DevOps By Eric Bruno DevOps failure is a touchy subject with some, because DevOps is typically perceived as a way to avoid failure. As a result, when you fail in a DevOps practice, the situation can seem almost hopeless. However, just as a fail-fast business approach, or the "fail and adjust sooner" methodology of Agile often proves, DevOps failures are actually a step in the right direction. They're the first step toward learning from failures and turning your DevOps practice into one that will lead you toward even greater success, sooner rather than later. DevOps has its roots in Agile, where shorter development cycles with frequent feedback loops guide you quickly, over a period of time, toward product delivery that is more aligned with customer needs. The point of the feedback loop is to learn from your actions through customer feedb... (more)

Journey to the Cloud, Better with Incident Management |@CloudExpo #Agile #CloudNative

Journey to the Cloud, Better with Incident Management By Vince Power Many IT organizations have come to learn that leveraging cloud infrastructure is not just unavoidable, it’s one of the most effective paths for IT organizations to become more responsive to business needs. Yet with the cloud comes new challenges, including minimizing downtime, decreasing the cost of operations, and preventing employee burnout to name a few. As companies migrate their processes and procedures to their new reality of a cloud-based infrastructure, an incident management solution can and should be a... (more)

Intern Spotlight: Yiyun Liang

My name is Yiyun and I’m currently a Computer Science student at the University of Waterloo. I’m a Software Engineer intern on the Core team here at PagerDuty. In this post, I would like to share some reflections on my experience over the past four months at PagerDuty. My team maintains and develops several core libraries and services that are used by different teams so that the engineering organization at PagerDuty can move forward rapidly. Internship Highlights A regular day of work involved getting into the office in the morning, grabbing a cup of tea or coffee, attending our... (more)

Modern Banking’s Hurdles in the Technology Race

With the rise in FinTech startups, increasingly stringent regulations, and changing customer behaviors and expectations, financial institutions need to adapt quickly in order to stay relevant and competitive in today’s market. We recently met with executives from over 15 global banks to learn more about the challenges they face when it comes to embracing digital transformation. And the challenges are many—the financial services industry must maintain compliance with regulatory requirements and ensure cybersecurity while being flexible enough to meet the demands of customers. And w... (more)

Using Postmortems to Understand Service Reliability

2017 was a year of many major outages—some took down the Internet for hours while others disrupted business workflows and communication at companies large and small. Any way you slice it, these outages likely resulted in a lot of time devoted to postmortems. I want to reflect a bit on why we write postmortems and suggest some things for authors to think about when writing them. I think there’s room for all of us to improve when it comes to gathering information to better plan pro-active fixes before services catch fire. Why Do We Conduct Postmortems? Our incident response trainin... (more)