For the last few decades, application and operational teams have been preaching loudly about monitoring and alerting capabilities to improve system resiliency. It is only with COVID-19, however, that these capabilities have shifted from “luxury” items to “absolutely necessary” capabilities to ensure business continuity.
As businesses take the next steps to refine their monitoring and alerting capabilities, it is critical that all stakeholders understand and utilize the same frameworks and metrics.
MMC Customer stories series: SNMP alerts
We are evaluating using Nagios or a similar Big Brother Enterprise solution for displaying information about our many Enterprise Mule instances, applications and the status of our production and staging environments. The only sure thing is that we presented a budget for buying a couple of 55” monitors and were approved. Pure awesomeness!
First solution: Do it yourself
This is complicated.
I’m proud to announce that we have released the new Mule ESB Management Console (MMC) — this is an important step forward for Mule ESB.
We built MMC based on significant feedback from our customers, and we put the product through two early access pre-releases to incorporate feedback from real users. I must say that I’m pretty pleased with the end result —
The promise of a monitoring solution that will pinpoint application problems and give you exact steps to fix the problem has remained a dream. In addition, monitoring systems have become notorious for being expensive and difficult to maintain. Diagnosing application performance problems requires application-specific diagnostic information that general-purpose monitoring tools often do not provide.
While system monitoring products are useful in triaging a problem and assigning responsibility to a particular team (for ex: Application Server team),