|[0:00:00]||Welcome back to the Checkmk channel and in this episode, we're going to take a look at acknowledging problems in Checkmk.|
|[0:00:16]||So what is a problem, simply said whenever a host or service does not have the status OK. So when a host has the status down or a service has the state is CRIT, WARN or Unknown. Checkmk distinguishes between handled and unhandled problems, whenever a problem is handled that means that the user knows about it and that someone has taken care of it and marking these problems as handled is what we call acknowledging the problem.|
|[0:00:45]||Whenever you acknowledge a problem it will have three effects: you will see an icon next to the service name indicating that the problem is acknowledged, you will stop receiving any notifications and it will be removed from certain views. For example, it will disappear from the unhandled problem section in the sidebar.|
|[0:01:03]||Now let's take a look how all of this works. Acknowledging problems works via commands to execute a command you first have to choose the service on which you want to execute the command. So let's now go to the service problems view and we can get there by using the sidebar and clicking on the unhandled problems here.|
|Here you see all the unhandled problems for this system. Let's choose the root file system of our Checkmk server for example. Now to acknowledge a problem we can simply click on the acknowledge problems button or we can go to the commands menu and click on acknowledge problems here.|
|[0:01:48]||And you see that there is a required field called Comment, you could use this for example to let your colleagues know how you are going to fix the problem. So let's type for example: ordered new disk and Acknowledge. Yes, confirm.|
|[0:02:10]||And let's go back to the view and now you see that there are two new icons here, you got this one (acknowledging) indicating that the problem has been acknowledged and you got the speech bubble and if you hover over it you'll see the message that we just typed in, 'ordered new disk'.|
|[0:02:30]||Now if we refresh the page or wait a little, we'll also see that here the unhandled problems is one less than we just had. So it's four now instead of five. So let's go back to that view again.|
|[0:02:47]||Now you see that there is just four unhandled problem. You can also acknowledge multiple problems at the same time and you could do that by simply clicking acknowledge problems here in this view.|
|[0:03:02]||This would acknowledge the problem for all of the services in this view. So we could do here and we type for example 'Load is OK' and now acknowledge and now you see here in this message 'Do we really want to acknowledge the problems for the following 4 services?' So let's confirm.|
|[0:03:30]||And now let's go back to the view and you see that all the problems have disappeared here. And once again in the sidebar there's no unhandled problems anymore. You might run into a situation where you want to remove an acknowledgement. To do that let's go to monitor and then 'Service problems'.|
|[0:03:54]||Here in this view you will see all handled and unhandled or unacknowledged problems. Once again you can remove the acknowledgement for a single problem or for all at once. To remove it for all at once, simply click on acknowledge problems and I click the 'Remove acknowledgement' button.|
|[0:04:17]||Once again you see the message (if) 'Do you really want to remove the acknowledgement for all 5 services?' Yes, we want to so press 'Confirm'. Let's head back to the service problems view and now you'll see that the two icons have disappeared.So now also we have five unhandled problems again. Like I said you can do that this for one service at a time or for all at once but you can also make a sub-selection and perform the command on a few problems at a time.|
|[0:04:48]||To do that you could use a filter to filter out some of the problems and then apply the acknowledgement on that filtered view or you can use the checkboxes like this. And you could, for example, pick these three, press 'Acknowledge problems', type in your comment and press Confirm.|
|[0:05:23]||Now you have two links here instead of one. If you want to return to the previous view with the select boxes still checked then you can press 'Back to view', if you want to reset the checkboxes press the second link. So if you want to perform another command on the same services, you could go back to view with the checkboxes still selected.|
|[0:05:52]||You might have noticed that there were some additional options when acknowledging problems which I haven't explained yet. So now let's take a look at those because they can be quite useful. So let's go back to the unhandled problems again.|
|[0:06:07]||And click on Acknowledge problems, now you see that there are three checkboxes here and the possibility to add a date in days hours and minutes. Let's start with this. So when you set a time here you basically set an expiration date for the acknowledgement.|
|[0:06:27]||For example, when you expect a problem to be resolved in one day you can configure that here. Then after that day if the problem still exists it will reappear under unhandled problems.|
|[0:06:38]||And you will once again start receiving notifications. However, this feature is only available in the enterprise edition and in the free version of the enterprise edition.|
|[0:06:48]||Then there is also this checkbox called 'sticky', this is a bit more subtle. It can happen that a service first goes to WARN and later to a CRIT state, the question then is what happens with the acknowledgement? For example, let's say that the file system server should go to warning when it's 80% full, but then later on it goes to the CRIT state.|
|Should it be a new unhandled problem? or should it still be acknowledged? When you want it to appear as a new problem then you need to make sure that this checkbox is not selected.|
|[0:07:26]||If it is selected then the problem will be acknowledged until the service goes back to OK. The send notifications checkbox simply means that a notification will be sent (to the responsive people) to the responsible people notifying them that the problem has been acknowledged.|
|[0:07:46]||We'll make a separate video about notifications sometime in the future. Then lastly we have the persistent comment feature. If this is selected that means that the comment will not disappear when the acknowledgement does.|
|[0:08:03]||Normally they are connected in such a way that when a service goes back to OK both the acknowledgement and the comment disappear but with this you can basically pin the comment to the service. This also means that you would have to remove it manually when you don't need it anymore.|
|[0:08:24]||So that was it for acknowledging problems in Checkmk. I hope this episode was helpful to you. If so, please subscribe to the channel and like the video. See you in the next episode.|
Ep. 1: Installing Checkmk 2.0 and monitoring your first host
In this video, Baris explains how to take get started with Checkmk and start monitoring your first host within a few minutes.
Ep. 2: The Checkmk 2.0 user interface
In this video, Baris take you through the new user interface in Checkmk 2.0. He explains the various components of the User interface such as the new navigation menus, the Sidebar, main dashboard, tactical overview, how to switch between the Checkmk interface themes and much more
Ep. 3: Using SNMP to monitor network devices in Checkmk 2.0
In this episode, Baris explains how to monitor network devices with Checkmk. SNMP is a protocol that many switches, routers, printers, UPSs, hardware sensors and other devices have implemented with the purpose of being able to monitor them easily.
Ep. 4: Monitoring Windows in Checkmk
In this video of our Getting started with Checkmk series, Baris explains how to install a Checkmk agent on a Windows host system and add that into your monitoring environment.
Ep. 5: Using metrics and graphs in Checkmk 2.0
In the 5th episode of the Getting started with Checkmk series, Baris explains using various metrics that you can monitor in Checkmk such as CPU utilization, CPU load etc. You can also see graph visualizations for these metrics or create and customize your own as per your requirements.
Ep. 6: Updating Checkmk 2.0 and using multiple instances
In this video, Baris explains how to update your Checkmk instance. It is very easy and can be done within minutes. You can run multiple Checkmk instances with different versions on the same system. This gives you the flexibility to test the new version before using it in production.
Ep. 7 (part 1): Working with rules and setting thresholds in Checkmk
In the following three-part videos series, Baris explains rule-based monitoring with Checkmk. In the first part, he shows you how you can work with rules and set threshold values. Rule-based configuration is one of the key features for Checkmk which helps you to scale your monitoring easily within minutes.
Ep. 7 (part 2): Smart rules with Host Tags in Checkmk
In the second part of this video, Baris explains using Smart rules with host tags in Checkmk. In the first part, he shows you how you can work with rules and set threshold values. These are features that you can use to build your rules even more intelligently and to better organize your monitoring.
Ep. 7 (part 3): Managing Hosts in Folder in Checkmk
In this final part of our episode on Rule-based monitoring in Checkmk, Baris demonstrates how to manage hosts in folders in Checkmk. This helps you to apply your monitoring configurations at scale and organize your hosts according to your needs.
Ep. 8: Working with Host and Service Groups in Checkmk
In this Baris demonstrates how to create host and service groups in Checkmk, so you can perform actions on an entire group instead of configuring each of them individually.
Ep. 9: Using the Quicksearch function in Checkmk
In this episode of the Checkmk tutorials, Baris shows how you can use the Quicksearch function in Checkmk. You can use it to easily find and manage certain hosts or services. He also explains some examples of filters to you. In Checkmk 2.0 you can use the same syntax in the Seach function found in the monitor menu to get identical results.
Ep. 10: Detecting configuration errors with the Analyze Configuration feature
With the Analyze Configuration feature, you can check if there are any configuration errors in your installation. Checkmk controls a number of possible security risks or potential performance restrictions and indicates if there are any problems.
Ep. 11: View creation and customization in Checkmk
In this video, Baris demonstrates how to customize headers, columns, and more in Views in Checkmk for yourself or other users. He also explains how to create custom views and add desired information to these views.
Ep. 13: Scheduling downtimes in Checkmk
In the episode of our Getting started with Checkmk series, Baris explains how you can manage the maintenance times of your systems in Checkmk. Such scheduled downtimes prevent your monitoring from sending false alarms when a host or service goes to WARN or CRIT during maintenance work. You can also inform the users concerned about the maintenance via Checkmk.
Ep. 14: Distributed monitoring with Checkmk
In this video, Baris explains how you can connect several Checkmk instances to a monitoring system and then manage it.
Ep. 15: MKPs and Plugins in Checkmk
In the 15th episode of our Getting started with Checkmk tutorial series, Baris explains what are Checkmk Extension Packages (MKPs) and how easy it is to integrate them into your Checkmk monitoring environment. MKPs are the preferred format when you make your own extensions as it makes it easy to share with other users or deploy in distributed environments.
Ep. 16: Working with 'Bulk Actions' in Checkmk
In this episode of our Checkmk tutorials series, Baris explains how you can save a lot of time with bulk actions. With this feature you can perform various tasks such as deleting, renaming, service discovery etc. on a large number of hosts simultaneously.
Ep. 17: Working with network topologies in Checkmk
In this video of our gettign startted with Checkmk series, Baris explains how to map network topologies in Checkmk. This feature is quite helpful to manage your network and prevent any unnecessary notifications from the devices in your network.
Ep. 18: Creating and customizing dashboards in Checkmk
In this video of our Getting started with Checkmk series, Mathias explains how you can create and customize dashboards in Checkmk 2.0, so you can get insights into your monitoring according to your requirements. Find out more in this video.
Ep. 19: Monitoring websites and their certificates with Checkmk
In this episode, Bastian demonstrates how to monitor a website and its certificate with Checkmk. You can also monitor specific web pages with Checkmk by using the several options that will suit your use case. Learn more in this video.
Ep. 20: Configuring dashboard elements in Checkmk
Learn how to add data visualization elements of the various metrics into your Checkmk Dashboard. In this video, Mathias explains how you can configure these elements and create a dashboard as per your requirements.
Ep. 21: Setting up notifications in Checkmk
Learn how to set up notifications in Checkmk and assign relevant contacts and contact groups to be notified for various events. Later in this video, our presenter Bastian also demonstrates how you can set up rule-based notifications according to different conditions for hosts and services.
Ep. 22: Monitoring logfiles with Checkmk
Monitor your logfiles with Checkmk using its Logwatch plugin. It is very useful when you want to monitor your logfiles regardless of whether you are using a UNIX/Linux or a windows based system. Learn more in this video.
Ep. 24: 3 Rules for efficient network monitoring
In this video, Bastian demonstrates 3 rules that will help you to efficiently monitor your network interfaces. With Checkmk 2.0, with just three rules, you can set up an efficient network monitoring that will not only monitor all of your network interfaces but also simultaneously provide a detailed overview of all of your ports.
Ep. 25: New UX and security improvements in Checkmk 2.1
Checkmk 2.1 come with many UX improvements such as pre-built dashboards for Linux and Windows, faster core performance and much more. Security features such as two-factor authentication etc. were also added in this new version. Watch this video to learn how to use these new features and enhancements in Checkmk.
Ep. 28: Working with InfluxDB integration in Checkmk
Learn how to send data to InfluxDB from Checkmk. As InfluxDB introduced a new protocol to send data to it, a new connector was developed with Checkmk to talk natively with it. Learn more about it in this video.
Ep. 29: New agent architecture in Checkmk 2.1
With Checkmk 2.1, the agent architecture was modified to enable performance improvements and add new features such as TLS encryption, data compression, and the reversal of direction of communication from the agent. This will enable push mode and pull mode.
Ep. 30: Clustering the Checkmk appliance
In this video, Robin demonstrates how you can cluster your Checkmk appliance to make it resilient against hardware failures. If you are using the Checkmk hardware appliance, it may be helpful to cluster your appliance to maintain high availability.
Ep. 32: Working with the Agent bakery in Checkmk
In this video, Robin demonstrates how to roll out agent packages with the required configuration for different monitored systems using the agent bakery in Checkmk. The "Automatic agent update" is quite a helpful feature as it pulls the latest configurations for an agent automatically and you don't need to manually update all of your agents deployed on different systems.
Ep 33: Monitoring Docker containers with Checkmk
Learn how to monitor Docker containers with Checkmk.In this video, Robin demonstrates the process of setting up a rule to configure the docker plugin and bake an agent with the desired settings for the Docker host.
Ep 34: Introduction to Checkmk Ansible collection
Last year the Checkmk Ansible collection was created to interact with the Checkmk REST API. In this video, Robin demonstrates how you can use this Ansible collection to automate your monitoring with Checkmk.
Ep 35: Monitoring SQL databases with Checkmk
In this video, Robin demonstrates how you can configure your Checkmk site to monitor your SQL databases. As there are many flavours of SQL databases, the process is mostly the same.
Ep. 36: Introduction to different types of checks for monitoring with Checkmk
Learn about the different types of "checks" and services in Checkmk. In this video, Robin demonstrates how you can expand the information collected by your Checkmk agent using these different "Checks".
Ep. 43: Working with Hardware/Software inventory in Checkmk
In this video, Robin demonstrates the hardware/software inventory feature in Checkmk. With this feature one can get an overview of various pieces of hardware present in their servers, switches etc. and also the software packages installed on their operating system. Watch this video to learn more.