ITSM Consultant at a consultancy with 501-1,000 employees
Consultant
Top 20
2024-02-29T03:58:05Z
Feb 29, 2024
I've integrated it into a tech demo. It's basically in our system, a system tool. I've done some deduplication inside of it. I can also set the force by directional communication from JSM. Opsgenie is really good. It integrates pictures and links directly into Jira Service Management. And that enables it, it enables vision and between getting the problem solved and finding who to find to solve that problem. So, Opsgenie is a really good tool.
Engineering Manager at a tech vendor with 201-500 employees
Real User
2022-10-06T14:09:54Z
Oct 6, 2022
We have several different teams that need to be on-call for several different products. So, we use Opsgenie to manage that, and we also use it for routing our alerts from a different monitoring service. We also use Opsgenie to control some of our schedules for different activities. For example, we have internal support for some of our teams, and even though this does not require the teams to receive any types of alerts or monitoring, we still use it for scheduling so that it can control who will be the next on-call person. We are using it as a service. So, we don't host it in any way. I believe we are using the latest version because we do not control its version. It's a service.
Senior Build And Release Engineer at a tech services company with 1,001-5,000 employees
Real User
2022-10-04T16:28:18Z
Oct 4, 2022
We have CloudWatch metrics like Grafana and AWS metrics that tell us about our registration counts, the services we run, such as microservices running on containers, and container orchestration systems like Kubernetes. We need to know how many resources are being used, how many services are being utilized, and how many registrations are expected and how/are not expected. We are establishing thresholds. Opsgenie receives all of those alerts and simply notifies us if there is a P1 issue, (a priority one issue) for us. My company is a financial services company. Priority one issues are more likely to be related to services that have an SLA of 99.9999% and the service is down for example. Opsgenie sends alarm alerts to the appropriate people at the appropriate time. We have schedules, and we have on-call personnel who will receive Opsgenie alerts in the form of a call or a message. The schedule varies according to the individual. Then he'll get a call from the next guy on his calendar. Opsgenie manages everything.
Sr software engineer at a tech services company with 5,001-10,000 employees
Real User
2022-09-16T14:21:13Z
Sep 16, 2022
We use OpsGenie for alert management and incident management. We have integrated it with three different tools. We have integrated it with Nagios, SolarWinds, and AppDynamics. These three tools generate the alerts that are sent to OpsGenie. A regular alert moves as a Jira ticket. So, SolarWinds is the source, OpsGenie is the mediator, and Jira is the end receiver. People interact more with Jira. It is a part of our alert management. For incident management, we have given access to a few of the service desk engineers. They're using OpsGenie for incident triage and documentation. Incident management, triage, and chronology are captured in Confluence. We have an option to export to Confluence. So, we are able to capture various types of information such as what happened, who gave an update, when the incident got closed, and how they got the solution. The root cause analysis process is being managed by OpsGenie. It is completely on the cloud.
Monitoring and interim management are the two main features for which we are using OpsGenie. We have an infrastructure with around 400 plus critical nodes that we are monitoring. They are scattered across various geographies and include on-premise switches, servers, other devices, and a few cloud services. There are 10 to 20 engineers who monitor it. We also do total remediation of our own. If something is broken, we'll try to fix it through our software. The use cases also vary depending upon the client's requirements while designing a solution. We are a kind of service provider. We have our own in-house cloud management software. Based on a client's needs, we evaluate and provide the inputs.
AWS Developer at a computer software company with 1,001-5,000 employees
Real User
2020-08-12T07:01:00Z
Aug 12, 2020
We have a monitoring tool called Nagios, and we integrate Nagios and OpsGenie. The tickets flow from Nagios to OpsGenie. We have automation scripts. OpsGenie has the capability to resolve the tickets if it is predefined. If an instance goes down, then to restart the instance, we have inbuilt scripts. When the ticket comes, like instance down or instance failed, we will configure it back to restart the instance. So, that will happen with OpsGenie. It's an incident management tool.
Users are empowered to manage their contact details and notification preferences in a single location, hence the administrators don't have to maintain this information at each application. OpsGenie is a cloud based service with reliable, distributed architecture that is replicated in multiple data centers, and monitored around the clock. The lifecycle for each alert, notifications, actions taken by users, are recorded and reported to enable admins to easily analyze what happened. No more...
I've integrated it into a tech demo. It's basically in our system, a system tool. I've done some deduplication inside of it. I can also set the force by directional communication from JSM. Opsgenie is really good. It integrates pictures and links directly into Jira Service Management. And that enables it, it enables vision and between getting the problem solved and finding who to find to solve that problem. So, Opsgenie is a really good tool.
We use the product for email notifications from the automatic application.
The primary use case of the solution is to schedule on-call personnel and notify them about any alerts that require their attention.
We have several different teams that need to be on-call for several different products. So, we use Opsgenie to manage that, and we also use it for routing our alerts from a different monitoring service. We also use Opsgenie to control some of our schedules for different activities. For example, we have internal support for some of our teams, and even though this does not require the teams to receive any types of alerts or monitoring, we still use it for scheduling so that it can control who will be the next on-call person. We are using it as a service. So, we don't host it in any way. I believe we are using the latest version because we do not control its version. It's a service.
We have CloudWatch metrics like Grafana and AWS metrics that tell us about our registration counts, the services we run, such as microservices running on containers, and container orchestration systems like Kubernetes. We need to know how many resources are being used, how many services are being utilized, and how many registrations are expected and how/are not expected. We are establishing thresholds. Opsgenie receives all of those alerts and simply notifies us if there is a P1 issue, (a priority one issue) for us. My company is a financial services company. Priority one issues are more likely to be related to services that have an SLA of 99.9999% and the service is down for example. Opsgenie sends alarm alerts to the appropriate people at the appropriate time. We have schedules, and we have on-call personnel who will receive Opsgenie alerts in the form of a call or a message. The schedule varies according to the individual. Then he'll get a call from the next guy on his calendar. Opsgenie manages everything.
We use OpsGenie for alert management and incident management. We have integrated it with three different tools. We have integrated it with Nagios, SolarWinds, and AppDynamics. These three tools generate the alerts that are sent to OpsGenie. A regular alert moves as a Jira ticket. So, SolarWinds is the source, OpsGenie is the mediator, and Jira is the end receiver. People interact more with Jira. It is a part of our alert management. For incident management, we have given access to a few of the service desk engineers. They're using OpsGenie for incident triage and documentation. Incident management, triage, and chronology are captured in Confluence. We have an option to export to Confluence. So, we are able to capture various types of information such as what happened, who gave an update, when the incident got closed, and how they got the solution. The root cause analysis process is being managed by OpsGenie. It is completely on the cloud.
Monitoring and interim management are the two main features for which we are using OpsGenie. We have an infrastructure with around 400 plus critical nodes that we are monitoring. They are scattered across various geographies and include on-premise switches, servers, other devices, and a few cloud services. There are 10 to 20 engineers who monitor it. We also do total remediation of our own. If something is broken, we'll try to fix it through our software. The use cases also vary depending upon the client's requirements while designing a solution. We are a kind of service provider. We have our own in-house cloud management software. Based on a client's needs, we evaluate and provide the inputs.
We have a monitoring tool called Nagios, and we integrate Nagios and OpsGenie. The tickets flow from Nagios to OpsGenie. We have automation scripts. OpsGenie has the capability to resolve the tickets if it is predefined. If an instance goes down, then to restart the instance, we have inbuilt scripts. When the ticket comes, like instance down or instance failed, we will configure it back to restart the instance. So, that will happen with OpsGenie. It's an incident management tool.