Company Profile
Follow Us:

Service Monitoring: GCP SLOs And SLIs

Table of Contents

You are a business owner running multiple businesses and handling infrastructure. Everything is going well until one of your business users faces downtime and unavailability, and you are unable to pinpoint the cause.
There come the Service Monitoring 


What is Service Monitoring:

A measurable goal for service performance over a period of time. Monitoring the health of Google Cloud microservices by providing the tools to set up alerting policies on the performance of service level objectives (SLOs).


What are SLOs:

SLOs define a time period for compliance and set a goal for good service. They are also considered a statement of desired performance. Moreover, they encapsulate your performance goals for the service.


What are SLIs:

SLIs collect metrics that measure the performance of the service infrastructure. The performance metrics are the basis of the SLIs for your service. It describes the performance of a particular aspect of your service.

If your service has request-count or response-latency metrics, then SLIs can be derived from those by creating a ratio. i-e

Ratio of number of successful responses to the number of all responses.
Determining how available your service was to users.

Ratio of number of calls below a latency threshold to the number of all calls.
Measures how quickly your service responded to users. You’ll get a metric related to how many responses were faster than a threshold that you define.

Some service-specific SLIs used for the evaluation method of how compliant the service is are :

Counts individual events, letting you know how well your service performed over the entire compliance period, regardless of load distribution.

Counts good minutes versus bad minutes according to the criteria you define. 
This lets you measure performance in terms of time, regardless of how the load is distributed.


Creating and Implementing SLO:





Using SLO-based alerts:

You create one or more alerting policies to monitor the error budget for your SLO.
The policy job is to warn you when your service is consuming the error budget quickly.

You can create the alerting policy either from the Console or the API.

A specific type of alert that warns you if a large, sudden change in consumption is noticed, which utilizes the error-budget quickly.

A specific type of alert that warns you if the rate of consumption is not changed, and it will consume the error budget before the compliance period ends.


Conclusion:

In conclusion, the integrated approach of SLIs, SLOs empowers organizations to:

SLIs provide the essential quantitative metrics (latency, availability) that serve as the foundation for measuring a service’s actual performance from the user’s perspective.

SLOs define specific, internal targets for these metrics, acting as a shared understanding across engineering and product teams for what constitutes an acceptable level of service quality.

While SLOs are internal goals, they support formal, external SLAs that set clear expectations with customers and define the consequences of not meeting agreed-upon service levels.

Service Monitoring- GCP SLOs And SLIs - cta - blog - eurus technologies
Loved❤️Reading? Share this blog
// We Carry more Than Just Good Coding Skills

Let's Evolve Your Business!