What is our primary use case?
We mostly use it for our on-call engineers, for schedules, alerting, and critical alerts. And, of course, we use it for the management of an issue, so that people acknowledge the alerts, reassign them, etc.
How has it helped my organization?
PagerDuty enables us to implement well-known techniques for on-call. It helps with managing schedules, being able to escalate issues, and being able to add people to an instance. It enables us to have multiple levels of people on-call, in a simple way. Otherwise, managing those schedules, offline, would be quite hard. Having a tool like PagerDuty to support us from that standpoint is really key, so that we can perform well when we have issues.
The fact that it doesn't have maintenance windows is definitely something that is really key. It was a key element in our choice. Having maintenance windows for a monitoring system is just not acceptable. The impact is that we really trust PagerDuty. There were situations where a specific alert didn't reach PagerDuty. We looked into finding the reason elsewhere, and not in PagerDuty, because we really trust that PagerDuty, when it catches an alert, will actually deliver it.
What is most valuable?
The most valuable feature is definitely the flexibility of the schedule.
The mobile app is quite also good for what we do: for receiving alerts, acknowledging, assigning, adding new responders. It has rich features for our needs.
The alerting functionality is quite good. It is one of the key parts of our on-call process, so that people can react when things are not working as expected. We trust it a lot. It definitely works well. We have never missed alerts or had situations in which we felt that we could not trust the alerting functionality of PagerDuty. Alerts in our environment could be caused by a specific part of the platform not running right. If a database goes down, for example, we will get an alert. There could also be errors in an application or a particular zone of infrastructure is down, or if service-level objectives are being affected for some reason. In all these cases, we will get alerts.
What needs improvement?
The solution's analytics are okay. I don't think the features, at this point, give you a lot of insights. We have actually been trying to get insights from it but it hasn't really given us a lot of extra points to explore. We were looking at the number of alerts to see where many of the alerts were coming from. We never managed to get many insights on this. The solution has not enabled us to go beyond responding to incidents and to start predicting or preventing them.
It would help if they simplified the way you try to get insights or information about instances and to improve your situation regarding the groupings of alerts.
Even if there are a lot of different functionalities, a lot of prediction abilities in place, it's not really clear what the best practices are for using PagerDuty to get the best out of the platform. The concepts are quite complex, at times, for people to understand. They need a more straightforward way to use the product and get the best out of it. With all the concepts of escalation, services, and the schedule, it gets confusing.
For how long have I used the solution?
I've been using PagerDuty for one year. I joined this company a year ago and it was already using it.
What do I think about the stability of the solution?
The stability is very good. If I remember correctly, we have only had one incident with PagerDuty over the whole period that I have been using it. I also used PagerDuty at my previous company. My experience is that it is a very stable platform.
There are also no bugs in the apps, in my experience. It is a very high-quality product.
What do I think about the scalability of the solution?
It's a SaaS.
But in my previous company and also here, I see the cost of the platform as an impediment to scaling this from a small team to the whole company. But for the platform itself, we don't have to manage anything in terms of scaling.
We have about 100 people working with PagerDuty — mostly engineers. It is used across all the different engineering teams. The adoption rate is 100 percent.
How are customer service and technical support?
We have used their technical support once. It was okay. I don't think we managed to get to the bottom of the issue we were having, but it ended up that it was not that critical at that point, so we just let it go.
Which solution did I use previously and why did I switch?
In my previous company we used a home-grown system before PagerDuty.
How was the initial setup?
I was involved in the initial setup of PagerDuty in my previous company. Depending on how big your deployment — how many services you want, how many teams — it can get complex. But in general, it's simple.
What is complex is to go from using PagerDuty as an instance manager and as a pager, to getting to the next level and the advantages that PagerDuty sells a lot: intelligence, analytics; the extra functionality that they add. That functionality comes with a high price. For me, that is the complex part. It's not the start, rather the hard part is to really get the most out of what PagerDuty offers.
What was our ROI?
The return of investment is from how quickly we engage our teams, meaning that we don't waste time and money. It has returned the price that we pay, but the price is high.
What's my experience with pricing, setup cost, and licensing?
A con, a failure, is the cost which is quite high. But if you want to get a full-featured application and you have a big team...
Some important features are closed to a group because of the licensing. For example, one of the features that I always wanted to use but never managed to is the postmortem part of PagerDuty. To me, it is important that everyone in the organization be able to read any postmortem that is produced. PagerDuty only allows you to share it with people who have accounts. It doesn't have different levels of accounts. There is only a complete account and you have to pay for it.
You really need to understand what feature functionality you want from the solution and then see what the cost-benefit is for what you want to achieve. We tried the stakeholder licenses, but we ended up never using them. They don't have a lot of flexibility on that. It's almost like one type of licensing or nothing.
Which other solutions did I evaluate?
We have done some investigation internally, but nothing really serious. We looked at Opsgenie and VictorOps. We didn't see a reason to change. We trust PagerDuty.
The main pro, of course, of PagerDuty is that it is a full-featured application, with high trust. Many companies use it, big companies that trust it. This is definitely a big plus.
One of the reasons we stayed with PagerDuty was, of course, trust, but we also see the value of the added functionality that PagerDuty gives us. But if you don't get to use those — the things that are the differentiator for PagerDuty — then the cost starts to be quite high. You need to be completely in to be able to get the full advantage of PagerDuty. Competitors will give you a little bit less functionality, fewer luxury functions, but the cost is more accessible for the feature set that you really use.
What other advice do I have?
From the get-go, approach it in a way where you can get the most out of it. Really engage with PagerDuty from the beginning to support you on that journey. Otherwise, you will only be able to use the core functionalities that you can also get from the competition. Focus on the full platform and make the decisions that will simplify that. Don't do quick wins at the beginning that will not help you to take full advantage of the platform.
PagerDuty worked with us to find the right solution that fit our needs and budget, but they didn't do so as much as I would expect or as much as I would have liked. At some point we asked, "How can you help? How can we get the most out of PagerDuty?" We had some ideas and we engaged with their Professional Services, but the cost of that was quite high. It ended up that we never did what we wanted to do with PagerDuty. It got stuck at some point. They did help but we would need to pay even more to get the help that we need.
It's really key to manage your operations life cycle. You cannot go without having something in place. And it's important to have something that will support you. Whether it's PagerDuty or something else, that is really key.
If I don't look at the cost of it, I would rate PagerDuty between eight and nine out of 10. If I include the cost, it would be a seven.
Disclosure: I am a real user, and this review is based on my own experience and opinions.