Blog

Incident management insights, guides, and product updates from Rootly

Search...
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Celebrating Our Nine New G2 Awards

Celebrating Our Nine New G2 Awards

We’re proud to share that we've been recognized as a High Performer and Enterprise Leader in Incident Management for the sixth consecutive quarter in G2 Summer 2023 Report! In total, Rootly received nine G2 awards in the Summer Report.

JJ Tang

JJ Tang

September 5, 2023
3 min read
We Need to Talk About the Hero Pattern Among SREs

We Need to Talk About the Hero Pattern Among SREs

Hans Chung refers to the tendency for SREs to independently zoom in on one task or problem at a time, and the consequences that come with it, as the “solo hero pattern”. In this post, he explores some of the reasons it happens, and what SRE leaders can do about it.

Hans Chung

Hans Chung

August 22, 2023
6 min read
But It’s Not Our Fault! When Third-party Incidents Affect Your Service

But It’s Not Our Fault! When Third-party Incidents Affect Your Service

Between cloud service providers, payment processors, content delivery networks, and more, chances are you rely on external systems to keep your product working. So what do you do when someone else's incident becomes your problem? It’s probably not realistic to completely eliminate third-party dependencies, but there are things you can do to enhance your resilience against third-party failures and maintain trust with your customers when outages out of your control impact them.

Ashley Sawatsky

Ashley Sawatsky

August 14, 2023
5 min read
Rootly Raises $12 Million from Renegade Partners, Google Gradient Ventures, & XYZ Ventures

Rootly Raises $12 Million from Renegade Partners, Google Gradient Ventures, & XYZ Ventures

Rootly has already helped companies manage 60,000+ incidents and we are just getting started! We are on a mission to make reliability every company’s superpower.

JJ Tang

JJ Tang

August 10, 2023
4 min read
Kubernetes Incident Management Best Practices

Kubernetes Incident Management Best Practices

In this post, Rajesh Tilwani (Co-Founder of Humalect) covers a variety of strategies for preventing and managing incidents with Kubernetes.

Rajesh Tilwani

Rajesh Tilwani

August 3, 2023
15 min read
Improve Visibility and Capture More Data with Triage Incidents

Improve Visibility and Capture More Data with Triage Incidents

As new incidents emerge, there are often many unknowns about the size, severity, and cause of the problem. Sometimes it’s not clear if the problem is an incident at all. That’s where introducing a triage stage to your incident management process can help. In this post, we’ll look at the benefits of adding a triage layer to your incident management, and how Rootly’s Triage feature allows you to seamlessly transition from triage to real incident (or false alarm).

Ashley Sawatsky

Ashley Sawatsky

July 12, 2023
5 min read
Lessons from the CircleCI Security Incident

Lessons from the CircleCI Security Incident

What SREs can learn from the CircleCI security incident of January 2023.

Quentin Rousseau

Quentin Rousseau

January 9, 2023
4 min read
How Many SREs Does Your Company Need? Here’s How to Decide

How Many SREs Does Your Company Need? Here’s How to Decide

Tips for deciding how many SREs your company should hire.

JJ Tang

JJ Tang

October 9, 2022
5 min read
The Rogers Outage of 2022: 3 Crucial Takeaways for SREs

The Rogers Outage of 2022: 3 Crucial Takeaways for SREs

Millions of Canadians offline. For SREs, the Rogers outage is a lesson in the importance of testing updates, building redundant infrastructure and having a crisis communications plan.

JP Cheung

JP Cheung

August 5, 2022
5 min read