Monitoring and interim management are the two main features for which we are using OpsGenie. We have an infrastructure with around 400 plus critical nodes that we are monitoring. They are scattered across various geographies and include on-premise switches, servers, other devices, and a few cloud services. There are 10 to 20 engineers who monitor it. We also do total remediation of our own. If something is broken, we'll try to fix it through our software. The use cases also vary depending upon the client's requirements while designing a solution. We are a kind of service provider. We have our own in-house cloud management software. Based on a client's needs, we evaluate and provide the inputs.
AWS Developer at a computer software company with 1,001-5,000 employees
Real User
2020-08-12T07:01:00Z
Aug 12, 2020
We have a monitoring tool called Nagios, and we integrate Nagios and OpsGenie. The tickets flow from Nagios to OpsGenie. We have automation scripts. OpsGenie has the capability to resolve the tickets if it is predefined. If an instance goes down, then to restart the instance, we have inbuilt scripts. When the ticket comes, like instance down or instance failed, we will configure it back to restart the instance. So, that will happen with OpsGenie. It's an incident management tool.
Users are empowered to manage their contact details and notification preferences in a single location, hence the administrators don't have to maintain this information at each application. OpsGenie is a cloud based service with reliable, distributed architecture that is replicated in multiple data centers, and monitored around the clock. The lifecycle for each alert, notifications, actions taken by users, are recorded and reported to enable admins to easily analyze what happened. No more...
Monitoring and interim management are the two main features for which we are using OpsGenie. We have an infrastructure with around 400 plus critical nodes that we are monitoring. They are scattered across various geographies and include on-premise switches, servers, other devices, and a few cloud services. There are 10 to 20 engineers who monitor it. We also do total remediation of our own. If something is broken, we'll try to fix it through our software. The use cases also vary depending upon the client's requirements while designing a solution. We are a kind of service provider. We have our own in-house cloud management software. Based on a client's needs, we evaluate and provide the inputs.
We have a monitoring tool called Nagios, and we integrate Nagios and OpsGenie. The tickets flow from Nagios to OpsGenie. We have automation scripts. OpsGenie has the capability to resolve the tickets if it is predefined. If an instance goes down, then to restart the instance, we have inbuilt scripts. When the ticket comes, like instance down or instance failed, we will configure it back to restart the instance. So, that will happen with OpsGenie. It's an incident management tool.