Ep. 5: Using metrics and graphs in Checkmk 2.0
Read Video Transcript
|[0:00:01]||Hi, today we're discussing metrics and graphs in Checkmk.|
|[0:00:14]||Welcome back to the Checkmk channel. You've already seen them in previous episodes, but today I'm going to take a closer look at metrics and graphs.|
|[0:00:22]||When Checkmk monitors the service it not only determines if the state is OK, WARN or CRIT. But it also stores metrics, for example, the CPU utilization or the transfer bytes per second.|
|[0:00:34]||And this is quite interesting and useful on one hand to recognize and diagnose failures, and on the other hand for planning. For example, when a certain threshold is reached, you can decide it's time to scale up to your resources. And I'll quickly show you how this looks in Checkmk.|
|[0:00:51]||In order to show you more realistic data, we'll be doing this episode in our own internal monitoring system at tribe29. So let me just quickly pick a host.|
|[0:01:05]||Now here in the service overview, you sometimes see this graph icon. This indicates that there is that there are metrics available for this service. And if you hover over one of these icons, you'll see a preview of all the graphs.|
|[0:01:20]||And when I click on one, you'll get to an overview where all the values of all the metrics are depicted in time-series graphs. Checkmk stores the values of the metrics for up to 4 years.|
|[0:01:34]||With a measurement interval of 1-minute, this can quickly add up to a lot of data.|
|[0:01:38]||In order for you to be able to save this data on a normal hard drive, Checkmk compresses the data. For 2 days you will have data on a 1-minute interval, for 10 days the interval will be 5 minutes, then for the first 90 days there will be data available per 30 minutes and for 4 years you will have data points each 6 hours.|
|[0:02:01]||Each measurement interval Checkmk stores the minimum, maximum and average values.|
|[0:02:06]||This means that for 4 years you have access to all 3 of these aggregated values. Here in the top graph, you see the values of 2 metrics. The green part is the input bandwidth and the blue part is the output bandwidth.|
|[0:02:20]||When you scroll with your mouse wheel when hovering over the graph you can zoom in on the x-axis this allows you to investigate the smaller time period. Here on the right side of the graph you have five predefined time ranges each with a preview.|
|[0:02:37]||If you click on them for example last 8 days, you'll see the data for the last 8 days. You can also move over the x-axis by clicking and holding down the left mouse button and then moving from left to right while on the graph so this allows you to move through time.|
|[0:02:55]||When you move up and down with your mouse also while holding down the left mouse button, you can zoom in on the y-axis. So if you want to investigate a small peak you can zoom in and then zoom in on the y-axis as well.|
|[0:03:13]||You can resize the graph by clicking and holding down this bottom right corner. Let's collapse the sidebar. So when you drag it to the size that you want, you'll notice that the rest of the graphs are still the same size and you can fix that simply by reloading the page.|
|[0:03:32]||So now the graphs are all the same size. Many services come with more than one metric, so if you scroll down you see all the metrics that are part of this service. Network cards and switchboards have this metric called "number of packages per second". If you look at the top graph you'll see that the input data is a lot higher or a lot more than the output data.|
|[0:03:58]||But if you compare that to the number of packages per second you'll see that the input and output are roughly the same. There's also another important metric called, "the number of errors".|
|[0:04:10]||You'll see that right now it's empty like it's supposed to be. But if you look a bit further back in time you'll see that there were some errors. Whether or not these were important it's hard to determine from this graph.|
|[0:04:25]||But it can be an indication that you should do some more investigation to discover the underlying problem here.|
|[0:04:33]||You can do a whole lot more with these graphs than what I just showed you. There are graph collections in which you can add multiple single graphs in order to more easily compare them. You can also add graphs to dashboards and to reports.|
|[0:04:48]||You can even create your own, where you pick specific metrics from multiple hosts and services and add them into one graph.|
|[0:04:55]||There's also a Grafana connector which allows you to use data directly from Checkmk and Grafana and then display the data in a Grafana dashboard.|
|[0:05:05]||All the information we discussed in this video about metrics and graphs can also be found in our documentation.|
|[0:05:11]||That's it for today, if this video was helpful to you please like and subscribe. See you next time.|
Ep. 1: Installing Checkmk 2.0 and monitoring your first host
In this video, Baris explains how to take get started with Checkmk and start monitoring your first host within a few minutes.
Ep. 2: The Checkmk 2.0 user interface
In this video, Baris take you through the new user interface in Checkmk 2.0. He explains the various components of the User interface such as the new navigation menus, the Sidebar, main dashboard, tactical overview, how to switch between the Checkmk interface themes and much more
Ep. 3: Using SNMP to monitor network devices in Checkmk 2.0
In this episode, Baris explains how to monitor network devices with Checkmk. SNMP is a protocol that many switches, routers, printers, UPSs, hardware sensors and other devices have implemented with the purpose of being able to monitor them easily.
Ep. 4: Monitoring Windows in Checkmk
In this video of our Getting started with Checkmk series, Baris explains how to install a Checkmk agent on a Windows host system and add that into your monitoring environment.
Ep. 6: Updating Checkmk 2.0 and using multiple instances
In this video, Baris explains how to update your Checkmk instance. It is very easy and can be done within minutes. You can run multiple Checkmk instances with different versions on the same system. This gives you the flexibility to test the new version before using it in production.
Ep. 7 (part 1): Working with rules and setting thresholds in Checkmk
In the following three-part videos series, Baris explains rule-based monitoring with Checkmk. In the first part, he shows you how you can work with rules and set threshold values. Rule-based configuration is one of the key features for Checkmk which helps you to scale your monitoring easily within minutes.
Ep. 7 (part 2): Smart rules with Host Tags in Checkmk
In the second part of this video, Baris explains using Smart rules with host tags in Checkmk. In the first part, he shows you how you can work with rules and set threshold values. These are features that you can use to build your rules even more intelligently and to better organize your monitoring.
Ep. 7 (part 3): Managing Hosts in Folder in Checkmk
In this final part of our episode on Rule-based monitoring in Checkmk, Baris demonstrates how to manage hosts in folders in Checkmk. This helps you to apply your monitoring configurations at scale and organize your hosts according to your needs.
Ep. 8: Working with Host and Service Groups in Checkmk
In this Baris demonstrates how to create host and service groups in Checkmk, so you can perform actions on an entire group instead of configuring each of them individually.
Ep. 9: Using the Quicksearch function in Checkmk
In this episode of the Checkmk tutorials, Baris shows how you can use the Quicksearch function in Checkmk. You can use it to easily find and manage certain hosts or services. He also explains some examples of filters to you. In Checkmk 2.0 you can use the same syntax in the Seach function found in the monitor menu to get identical results.
Ep. 10: Detecting configuration errors with the Analyze Configuration feature
With the Analyze Configuration feature, you can check if there are any configuration errors in your installation. Checkmk controls a number of possible security risks or potential performance restrictions and indicates if there are any problems.
Ep. 11: View creation and customization in Checkmk
In this video, Baris demonstrates how to customize headers, columns, and more in Views in Checkmk for yourself or other users. He also explains how to create custom views and add desired information to these views.
Ep. 12: Acknowledging problems in Checkmk
In this video, Baris explains how you can acknowledge problems in Checkmk. This function helps you to qualify the states of hosts and services. This allows you to keep track of messages in the main dashboard and, for example, you can add comments to problems.
Ep. 13: Scheduling downtimes in Checkmk
In the episode of our Getting started with Checkmk series, Baris explains how you can manage the maintenance times of your systems in Checkmk. Such scheduled downtimes prevent your monitoring from sending false alarms when a host or service goes to WARN or CRIT during maintenance work. You can also inform the users concerned about the maintenance via Checkmk.
Ep. 14: Distributed monitoring with Checkmk
In this video, Baris explains how you can connect several Checkmk instances to a monitoring system and then manage it.
Ep. 15: MKPs and Plugins in Checkmk
In the 15th episode of our Getting started with Checkmk tutorial series, Baris explains what are Checkmk Extension Packages (MKPs) and how easy it is to integrate them into your Checkmk monitoring environment. MKPs are the preferred format when you make your own extensions as it makes it easy to share with other users or deploy in distributed environments.
Ep. 16: Working with 'Bulk Actions' in Checkmk
In this episode of our Checkmk tutorials series, Baris explains how you can save a lot of time with bulk actions. With this feature you can perform various tasks such as deleting, renaming, service discovery etc. on a large number of hosts simultaneously.
Ep. 17: Working with network topologies in Checkmk
In this video of our gettign startted with Checkmk series, Baris explains how to map network topologies in Checkmk. This feature is quite helpful to manage your network and prevent any unnecessary notifications from the devices in your network.
Ep. 18: Creating and customizing dashboards in Checkmk
In this video of our Getting started with Checkmk series, Mathias explains how you can create and customize dashboards in Checkmk 2.0, so you can get insights into your monitoring according to your requirements. Find out more in this video.
Ep. 19: Monitoring websites and their certificates with Checkmk
In this episode, Bastian demonstrates how to monitor a website and its certificate with Checkmk. You can also monitor specific web pages with Checkmk by using the several options that will suit your use case. Learn more in this video.
Ep. 20: Configuring dashboard elements in Checkmk
Learn how to add data visualization elements of the various metrics into your Checkmk Dashboard. In this video, Mathias explains how you can configure these elements and create a dashboard as per your requirements.
Ep. 21: Setting up notifications in Checkmk
Learn how to set up notifications in Checkmk and assign relevant contacts and contact groups to be notified for various events. Later in this video, our presenter Bastian also demonstrates how you can set up rule-based notifications according to different conditions for hosts and services.
Ep. 22: Monitoring logfiles with Checkmk
Monitor your logfiles with Checkmk using its Logwatch plugin. It is very useful when you want to monitor your logfiles regardless of whether you are using a UNIX/Linux or a windows based system. Learn more in this video.
Ep. 24: 3 Rules for efficient network monitoring
In this video, Bastian demonstrates 3 rules that will help you to efficiently monitor your network interfaces. With Checkmk 2.0, with just three rules, you can set up an efficient network monitoring that will not only monitor all of your network interfaces but also simultaneously provide a detailed overview of all of your ports.
Ep. 25: New UX and security improvements in Checkmk 2.1
Checkmk 2.1 come with many UX improvements such as pre-built dashboards for Linux and Windows, faster core performance and much more. Security features such as two-factor authentication etc. were also added in this new version. Watch this video to learn how to use these new features and enhancements in Checkmk.
Ep. 28: Working with InfluxDB integration in Checkmk
Learn how to send data to InfluxDB from Checkmk. As InfluxDB introduced a new protocol to send data to it, a new connector was developed with Checkmk to talk natively with it. Learn more about it in this video.
Ep. 29: New agent architecture in Checkmk 2.1
With Checkmk 2.1, the agent architecture was modified to enable performance improvements and add new features such as TLS encryption, data compression, and the reversal of direction of communication from the agent. This will enable push mode and pull mode.
Ep. 30: Clustering the Checkmk appliance
In this video, Robin demonstrates how you can cluster your Checkmk appliance to make it resilient against hardware failures. If you are using the Checkmk hardware appliance, it may be helpful to cluster your appliance to maintain high availability.
Ep. 32: Working with the Agent bakery in Checkmk
In this video, Robin demonstrates how to roll out agent packages with the required configuration for different monitored systems using the agent bakery in Checkmk. The "Automatic agent update" is quite a helpful feature as it pulls the latest configurations for an agent automatically and you don't need to manually update all of your agents deployed on different systems.
Ep 33: Monitoring Docker containers with Checkmk
Learn how to monitor Docker containers with Checkmk.In this video, Robin demonstrates the process of setting up a rule to configure the docker plugin and bake an agent with the desired settings for the Docker host.
Ep 34: Introduction to Checkmk Ansible collection
Last year the Checkmk Ansible collection was created to interact with the Checkmk REST API. In this video, Robin demonstrates how you can use this Ansible collection to automate your monitoring with Checkmk.
Ep 35: Monitoring SQL databases with Checkmk
In this video, Robin demonstrates how you can configure your Checkmk site to monitor your SQL databases. As there are many flavours of SQL databases, the process is mostly the same.