Use case Help me from tons of alarms – #OpenStack

Fujitsu Cloud had a performance issue of OpenStack API. Average of API response time was usually good(less than a few seconds), however, once the trouble happened, a large amount of time out errors occurred. We tried to detect the trouble with metrics monitoring(CPU, memory…), but could not configure the threshold for each metric properly. We just got a ton of alarms after the trouble happened. It’s very hard for operators to check whether the alarm is necessary or not for all alarms.

via OpenStack

About The Author
- The OpenStack Foundation promotes the development, distribution and adoption of the OpenStack cloud operating system. As the independent home for OpenStack, the Foundation has already attracted more than 9,500 individual members from 100 countries and 850 different organizations, secured more than $10 million in funding and is ready to fulfill the OpenStack mission of becoming the ubiquitous cloud computing platform.

Tell us what you think...