Checkmk provides features to solve every problem

Checkmk is a comprehensive solution for IT Monitoring of applications, servers, and networks. It comes with a vast breadth of features built-up jointly with our customers over many years. You can easily accomplish simple tasks, and monitor the most complex IT environments as well.

Automation

Automate your monitoring to save operator time

  • Monitor with less effort thanks to automatic detection and configuration of the components. You don't need to configure 'this is a firewall' — all relevant components, their metrics and thresholds will be suggested

  • Automated monitoring for dynamic ephemeral infrastructures: containers, pods, VMs and more can be automatically and dynamically added or removed from monitoring.

  • Use the modern rule-based 1-to-N configuration — which remains intuitive even in complex environments — resulting in lower configuration effort than comparable products

  • Centrally manage your agents and automate agent updating (with the 'Agent Bakery')

  • Powerful APIs to automate almost every action and integrate with other systems

  • Integrate data from a wide range of data sources and formats for metrics (JSON, XML, SNMP data, etc.)

Display with speedometer

CPU with gear wheel

Configuration

From zero-to-monitoring in minutes to maximize time to value

  • Fast installation from a single integrated package, available for many platforms and as a Docker container. No need to separately install and maintain databases and web servers.

  • Don't waste time thinking through which metrics you need — Checkmk auto-discovers the relevant metrics for you buidling on years of expert know-how

  • Use both a powerful agent-based monitoring and an agentless monitoring via HTTP or SNMP, or by connecting directly to the APIs of many applications

  • Quickly identify problems in your IT environment through a differentiated 'state' (OK, WARN, CRIT) of each monitored component or system — drill down with one click

  • Configure everything in a web interface. Fast, easy and less error-prone.

  • Use your existing role-based access controls (LDAP, AD) with a fine-grained permission model for users and groups


Plug-ins

Monitor everything with the broadest coverage of plug-ins

  • Leverage the broadest coverage of vendor-maintained smart plug-ins (>1,900) to collect relevant metrics from the most heterogeneous infrastructures

  • You will have a hard time finding any vendor maintaining such a vast base of plug-ins for you. Additionally, find even more plug-ins on the Checkmk Exchange, shared by our monitoring community.

  • Benefit from regularly updated plug-ins. Your software and hardware are constantly changing, so you also need plug-ins which can be maintained

Triangle ruler with pencil and cube

Display with an increasing chart

Performance

Scale your monitoring through a performance-optimized distributed architecture

  • Benefit from Checkmk's own high-performance core: the 'CMC' (the 'core' is the heart of every monitoring system — this queries the plug-ins, collects their results, provides information on states to the GUI, etc.)

  • Monitor thousands of services on one single monitoring server. No need to maintain and synchronize several monitoring instances in a single data center

  • Scale across hundreds of sites and millions of devices. Checkmk can support a world-wide distributed setup — hardly any other monitoring system can scale as well

  • Leverage highly efficient monitoring agents with minimal CPU, RAM and storage utilization. No need for DLLs, libraries etc. Runs on even the smallest servers


Hybrid Infrastructure Monitoring

Use modern monitoring concepts for cloud-native and on-premises architectures

  • Ingest data at high granularity to handle both traditional and emerging ephemeral container architectures
  • Sample in real-time, with measurement intervals as short as 1 second
  • Tag your data or auto-discover its tags or labels to provide relevant context on which to filter — labels for full flexibility and specifically set tags to ensure consistency
  • Store metrics in disk-space-efficient long-term storage
Server rack with cloud in background

Reporting icon

Visualization

Vizualize your data in Checkmk's modern customizable dashboards or in Grafana

  • Leverage graphic maps and diagrams with live-monitoring data to get a better view on the health of your IT
  • Analyze time-series metrics over long time horizons with interactive HTML5 graphs
  • Customize dashboards and views to your specific needs
  • Compare metrics across multiple graphs at a glance
  • Differentiate dashboards and views among users or user groups, e.g. vSphere specific views for VMware admins
  • Customize the side menus — e.g. monitoring admins need various statistics while network admins might only need reporting
  • Alternatively visualize your data in Grafana using the Grafana Checkmk datasource plugin or using Checkmk's Graphite exporter for InfluxDB

Customization

Use Checkmk's high flexibility to customize or extend it to your needs

  • Most parts of Checkmk are written in Python and can be easily customized and extended by yourself
  • Broad network of partners and tribe29 to adapt Checkmk or its plugins, where you desire
  • With version 2.0, Checkmk will come with a new "Check-API" – a set of common functions that make plug-in development much easier – as well as a developer documentation
Geometric shapes on a display

business intelligence icon

Log Monitoring

Combine metrics and log data for fast problem identification and root cause analysis

  • Monitor logs for the analysis of error messages via syslog, SNMP-traps and other arbitrary log files
  • Forward events to trigger scripts or generate notifications
  • Summarize equal entries to one single event (e.g. several failed logons) to stay on top of your events
  • Filter incoming messages to only see important events. No more manual filtering and information overload

Alerting

Alert with a smart and granular system to avoid notification overload

  • Notify the responsible team quickly — e.g. notify only the storage admins for a failing disk, but not the email admins
  • Notify via email, SMS and 3rd party tools such as ServiceNow, Jira, Slack, PagerDuty and VictorOps — use your established tools for handling incidents
  • Leverage comprehensive, rule-based notifications to fulfill complex custom requirements regarding time periods, service levels, etc.
  • Configure additional alerts or cancel an alert in specific situations. Escalate problems if they are not handled in time
  • Handle alerts centrally — also in distributed environments
  • Automatically trigger actions as reactions to detected problems (alert handler) — e.g. trigger remediation actions via scripts
Alert sign

alerthandling icon

Prometheus Integration

Bridge the gap between Dev and Ops teams via Checkmk-Prometheus integration (Beta)

  • Use Checkmk's powerful context monitoring to complement Prometheus' flexibility
  • Automatic processing of Prometheus metrics for specific exporters
  • Generation of structured information and context monitoring of Prometheus metrics
  • Execute PromQL queries directly from within Checkmk, e.g. to get data from code instrumentation and monitor this

Advanced Analytics

Predict trends and resource utilization with advanced analytics

  • Benefit from sophisticated algorithms and adapt your monitoring to the changing reality by using predictive monitoring
  • Use advanced analytics to analyze historic data, predict trends and forecast resource utilization
configuration icon

Microchip icon

Integrations

Integrate with major ITOM/ITSM tools to provide integrated workflows

  • Use powerful APIs to build deep integrations
  • Interface to standard off-the-shelf Configuration Management Databases (CMDB)
  • Feed the monitoring directly with already existing information from Configurable Management Databases (CMDB) via Checkmk's APIs.

Business Process Monitoring

Support business leaders by monitoring business process health

  • Monitor business processes by mapping dependencies of your application into a single overview
  • See the availability and performance of complex systems at a glance
  • Aggregate various services and hosts into a single state
  • Review historical states of an aggregation to understand root causes for degraded performances in the past
  • Deliver more reliable services to customers through a completely transparent view which is easy to understand
  • Configure all possible setups, such as HA with two or more nodes, HPC etc. with maximum and unique freedom
  • Simulate worst case scenarios in real time. See the impact of failing components before they become real
Screwdriver with gear wheel

Reporting dashboard

Hard- and software inventory

Identify all assets in and within your IT

  • Identify the inventory of all hardware and software installed on devices and systems
  • Monitor changes in hardware and software — use this unique feature to control the access to your hardware and servers and never miss installing a security update again
  • Benefit from integration of regularly updated dynamic parameters such as disk space utilization that are updated via monitoring data
  • Benefit from integrating regularly updated dynamic parameters, such as disk space utlization, with your CMDB to combine a "static" CMDB view with a "dynamic" monitoring view of your asset's state


Reporting

Automatically generate reports to easily share the state of the IT with management

  • Review the history of states over any desired timeframe with a single click, with computation of availability in real time
  • Deaverage availability data. Exclude non-monitored times, change the resolution, ignore short intervals
  • Get notified before you break your SLA contracts by monitoring the compliance of your complex SLAs. Even if the SLA definition contains only working hours
  • Generate PDF reports with your corporate identity containing all of the views you build — ad-hoc or automated at regular intervals
Tree chart

nagios alternative icon

Nagios alternative

Nagios compatibility for easy migration

  • Continue using your Nagios Checks for the (rare) cases that no Checkmk alternatives exist yet — the Checkmk Microcore supports existing Nagios checks in addition to the Checkmk checks.
  • No need to train your entire team, if it is accustomed to Nagios — Checkmk uses better concepts, where sensible. Other concepts have been adopted, simplifying the switch from the old Nagios world to the new Checkmk world
  • BUT: no need to live with any of the Nagios deficiencies — the Checkmk Enterprise Edition operates 100% stand-alone

Note: Some of the aforementioned features are available in the Checkmk Enterprise Editions only


What do you like about Checkmk?

See what our users have to say