MinMon - an opinionated minimal monitoring and alarming tool

Florian Wickert

Last update: Jan 5, 2023

Related tags

Overview

MinMon - an opinionated minimal monitoring and alarming tool (for Linux)

This tool is just a single binary and a config file. No database, no GUI, no graphs. Just monitoring and alarms. I wrote this because the existing alternatives I could find were too heavy, mainly focused on nice GUIs with graphs (not on alarming), too complex to setup, or targeted at cloud/multi-instance setups.

Checks

Implemented are these checks:

See Roadmap for further ideas.

Actions

Report

The absence of alarms can mean two things: everything is okay or the monitoring/alarming failed altogether. That's why MinMon can trigger regular report events to let you know that it's up and running.

Design decisions

No complex scripting language.
No fancy config directory structure - just a single TOML file.
No users, groups or roles.
No cryptic abbreviations. The few extra letters in the config file won't hurt anyone.
There are no predefined threshold names like "Warning" or "Critical". You might might want more than just two, or only one. So that's up to you to define in the config.
The same check plugin can be used multiple times. You might want different levels to trigger different actions for different filesystems at different intervals.
Alarms are timed in "cycles" (i.e. multiples of the interval of the check) instead of seconds. It's not very user-friendly but helps to keep the internal processing and the code simple and efficient.
Alarms stand for themselves - they are not related. This means that depending on your configuration, two (or more) events may be triggered at the same time for the same check. There are cases where this could be undesirable.
Simple, clean, bloat-free code with good test coverage.
Depending on your configuration, there may be similar or identical blocks in the config file. This is a consequence of the flexibility and simpleness of the config file format.
All times and dates are UTC. No fiddling with local times and time zones.
No internal state is stored between restarts.
As of now it's only for Linux but it should be easy to adapt to other *NIXes or maybe even Windows.
Some of the things mentioned above may change in the future (see Roadmap).

Config file

The config file uses the TOML format and has the following sections:

Architecture

System overview

graph TD
    A(Config file) --> B(Main loop)
    B -->|interval| C(Check 1)
    B -.-> D(Check 2..n)
    C -->|data| E(Alarm 1)
    C -.-> F(Alarm 2..m)
    E -->|cycles, repeat_cycles| G(Action)
    E -->|recover_cycles| H(Recover action)
    E -->|error_repeat_cycles| I(Error action)

    style C fill:green;
    style D fill:green;
    style E fill:red;
    style F fill:red;
    style G fill:blue;
    style H fill:blue;
    style I fill:blue;

Alarm state machine

Each alarm has 3 possible states. "Good", "Bad" and "Error".
It takes cycles consecutive bad data points to trigger the transition from "Good" to "Bad" and recover_cycles good ones to go back. These transitions trigger the action and recover_action actions. During the "Bad" state, action will be triggered again every repeat_cycles cycles (if repeat_cycles is not 0).

The "Error" state is a bit special as it only "shadows" the other states. An error means that there is no data available at all, e.g. the filesystem usage for /home could not be determined. Since this should rarely ever happen, the transition to the error state always triggers the error_action on the first cycle. If there is valid data on the next cycle, the state machine continues as if the error state did not exist.

stateDiagram-v2
    direction LR

    [*] --> Good
    Good --> Good
    Good --> Bad: action/cycles
    Good --> Error: error_action

    Bad --> Good: recover_action/recover_cycles
    Bad --> Bad: repeat_action/repeat_cycles
    Bad --> Error: error_action

    Error --> Good
    Error --> Bad
    Error --> Error: error_repeat_action/error_repeat_cycles

Example

Check the mountpoint at /home every minute. If the usage level exceeds 70% for 3 consecutive cycles (i.e. 3 minutes), the "Warning" alarm triggers the "Webhook 1" action. The action repeats every 100 cycles until the "Warning" alarm recovers. This happens after 5 consecutive cycles below 70% which also triggers the "Webhook 1" action. If there is an error while checking the filesystem usage, the "Log error" action is triggered. This is repeated every 200 cycles.

Config

[[checks]]
interval = 60
name = "Filesystem usage"
type = "FilesystemUsage"
mountpoints = ["/home"]

[[checks.alarms]]
name = "Warning"
level = 70
cycles = 3
repeat_cycles = 100
action = "Webhook 1"
recover_cycles = 5
recover_action = "Webhook 1"
error_repeat_cycles = 200
error_action = "Log error"

[[actions]]
name = "Webhook 1"
type = "Webhook"
url = "https://example.com/hook1"
body = """{"text": "{{alarm_name}}: {{check_name}} on mountpoint '{{check_id}}' reached {{level}}%."}"""
headers = {"Content-Type" = "application/json"}

[[actions]]
name = "Log error"
type = "Log"
level = "Error"
template = """{{check_name}} check didn't have valid data for alarm '{{alarm_name}}' and id '{{alarm_id}}'."""

The webhook text will be rendered into something like "Warning: Filesystem usage on mountpoint '/home' reached 70%."

Diagram

graph TD
    A(example.toml) --> B(Main loop)
    B -->|every 60 seconds| C(FilesystemUsage 1: '/srv')
    C -->|level '/srv': 60%| D(LevelAlarm 1: 70%)
    D -->|cycles: 3, repeat_cycles: 100| E(Action: Webhook 1)
    D -->|recover_cycles: 5| F(Recover action: Webhook 1)
    D -->|error_repeat_cycles: 200| G(Error action: Log error)

    style C fill:green;
    style D fill:red;
    style E fill:blue;
    style F fill:blue;
    style G fill:blue;

Some (more exotic) ideas

Just to give some ideas of what's possible:

Run it locally on your workstation and let it send you notifications to your desktop environment using the Process action and notify-send when the filesystem fills up.
Use the report in combination with the Webhook action and telepush and let it send you "I'm still alive, since {{minmon_uptime}} seconds!" once a week to your Telegram messenger for the peace of mind.

Placeholders

To improve the reusability of the actions, it's possible to define custom placeholders for the report, events, checks, alarms and actions. When an action is triggered, the placeholders (generic and custom) are merged into the final placeholder map. Inside the action (depending on the type of the action) the placeholders can be used in one or more config fields using the {{placeholder_name}} syntax. There are also some generic placeholders that are always available and some that are specific to the check that triggered the action. Placeholders that don't have a value available when the action is triggered will be replaced by an empty string.

Installation

Docker image

To pull the docker image use

docker pull ghcr.io/flo-at/minmon:latest

or the example docker-compose.yml file.
In both cases, read-only mount your config file to /etc/minmon.toml.

Build and install using cargo

Make sure cargo and OpenSSL are correctly installed on your local machine.
You can either install MinMon from crates.io using

cargo install --all-features minmon

Or if you already checked out the repository, you can build and install your local copy like this:

cargo install --all-features --path .

Copy the systemd.minmon.service file to /etc/systemd/system/minmon.service and place your config file at path /etc/minmon.toml. You can enable and start the service with systemctl daemon-reload && systemctl enable --now minmon.service.

If you don't want to include the systemd integration, leave out the --all-features option.

Install for the AUR (Arch Linux)

Use your package manager of choice to install the minmon package from the AUR.
Place your config file at path /etc/minmon.toml. You can enable and start the service with systemctl daemon-reload && systemctl enable --now minmon.service.\

systemd integration (optional)

Logging to journal.
Notify systemd about start-up completion (Type=notify).
Periodically reset systemd watchdog (WatchdogSec=x).

lm_sensors integration (optional)

Build with --features sensors to enable support for lm_sensors.
For the docker image, optionally mount your lm_sensors config file(s) to /etc/sensors.d/.
Note: libsensors is not cooperative and might theoretically block the event loop.

Roadmap

Check ideas

Filesystem inode usage
Folder size
S.M.A.R.T.
Load
Ping
HTTP response, keyword, ..
HTTPS certificate expiration date
Docker/Podman container status

General ideas

Store data/status in time-based database (e.g. rrdtool) and visualize on web interface or ncurses UI. This should be optional and separated from the existing code.

Contributions

Contributions are very welcome! Right now MinMon is pretty basic but it's also super easy to extend. Even if it's just a typo in the documentation, I'll be happy to merge your PR. If you're looking for a new check or action type, just open a new issue (if it doesn't exist yet) and tag it with the "enhancement" label.

If you have an idea for a new check or action please use use the discussions page instead of opening a new issue.

Comments

Rethink just a single toml file

The reasoning behind many/most monitoring solutions having "fancy" config directory structure is that its a requirement.

People need to be able to put monitoring in place for something they bring up with automated tools.

Editing a toml file is a nonstarter (and no concating it in doesn't work either).

I would recommend going through the mental or actual code exercise of parsing some example nagios configs. Steelman why things are the way they are and you'll find it illuminating (I did it a decade ago for an in house golang monitoring server).

https://en.wikipedia.org/wiki/Straw_man#Steelmanning

opened by hammerandtongs 1
implement new Temperature check using libsensors

This implements temperature checks using lm_sensors. Using /sys/class/thermal without lm_sensors turned out to now really work well because the numbering of the thermal zones is not deterministic. By default this feature is disabled (due to the extra dependency). The docker image is built with this feature enabled.
enhancement

opened by flo-at 0
Bump serde from 1.0.151 to 1.0.152
Bumps serde from 1.0.151 to 1.0.152.

Release notes

Sourced from serde's releases.

v1.0.152

Documentation improvements

Commits

ccf9c6f Release 1.0.152

b25d0ea Link to Hjson data format

4f4557f Link to bencode data format

bf400d6 Link to serde_tokenstream data format

4d2e36d Wrap flexbuffers bullet point to 80 columns

df6310e Merge pull request #2347 from dtolnay/docsrs

938ab5d Replace docs.serde.rs links with intra-rustdoc links

ef5a0de Point documentation links to docs.rs instead of docs.serde.rs

5d186c7 Opt out -Zrustdoc-scrape-examples on docs.rs

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies rust
opened by dependabot[bot] 0
Adjust default values so everything is opt-in.

Everything that is monitored should be visible in the config file. That means everything should be opt-in. Right now some check options are enabled by default (e.g. "memory" in MemoryUsage). This is a breaking change regarding existing config files, so this has to be done before the v1.0.0 release.
enhancement

opened by flo-at 0
Bump serde from 1.0.149 to 1.0.150
Bumps serde from 1.0.149 to 1.0.150.

Release notes

Sourced from serde's releases.

v1.0.150

Relax some trait bounds from the Serialize impl of HashMap and BTreeMap (#2334)

Enable Serialize and Deserialize impls of std::sync::atomic types on more platforms (#2337, thanks @badboy)

Commits

d493649 Release 1.0.150

0e947e6 Merge pull request #2338 from serde-rs/atomic

9249dab Deduplicate atomic_impl macro calls

7440e56 Deduplicate atomic_impl macro implementations

0d79306 Update atomic_impl macros to have same input syntax in all cfgs

37faaf2 Mention target_has_atomic stabilization

650358f Replace obsolete comment about target_has_atomic support

6159ead Invert use_target_has_atomic cfg

692ac99 Format PR 2337 with rustfmt

86161ce Adjust spacing in some macro matchers

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies rust
opened by dependabot[bot] 0
Bump env_logger from 0.9.3 to 0.10.0
Bumps env_logger from 0.9.3 to 0.10.0.

Changelog

Sourced from env_logger's changelog.

[0.10.0] - 2022-11-24

MSRV changed to 1.60 to hide optional dependencies

Fixes

Resolved soundness issue by switching from atty to is-terminal

Breaking Changes

To open room for changing dependencies:

Renamed termcolor feature to color

Renamed atty feature to auto-color

Commits

ff029fa chore: Release

389cc52 docs: Fix changelog links

2979c4b docs: Update changelog

4c37917 Merge pull request #248 from epage/atty

d55d269 style: Make clippy happy

066c219 fix: Replace atty with is_terminal

4db5e87 fix!: Rename termcolor/atty features

660cf7f fix: Bump MSRV to 1.60.0

e572d04 Merge pull request #244 from epage/update

f1ff331 docs: Fix typos

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies rust
opened by dependabot[bot] 0
Bump tokio from 1.22.0 to 1.23.0
Bumps tokio from 1.22.0 to 1.23.0.

Release notes

Sourced from tokio's releases.

Tokio v1.23.0

Fixed

net: fix Windows named pipe connect (#5208)

io: support vectored writes for ChildStdin (#5216)

io: fix async fn ready() false positive for OS-specific events (#5231)

Changed

runtime: yield_now defers task until after driver poll (#5223)

runtime: reduce amount of codegen needed per spawned task (#5213)

windows: replace winapi dependency with windows-sys (#5204)

#5208: tokio-rs/tokio#5208 #5216: tokio-rs/tokio#5216 #5213: tokio-rs/tokio#5213 #5204: tokio-rs/tokio#5204 #5223: tokio-rs/tokio#5223 #5231: tokio-rs/tokio#5231

Commits

3ce5a26 chore: prepare Tokio v1.23 release (#5270)

644cb82 rt: fix *_closed false positives (#5231)

a1316cd io: impl std::io::BufRead on SyncIoBridge\<T> (#5265)

86ffabe docs: add note about current-thread + Handle::block_on (#5264)

00bf5ee sync: improve watch docs (#5261)

8751010 Fix typo (#5255)

2be71ad chore: move conditional AtomicU64 impl to new file (#5256)

d1b789f rt: fix new yield_now behavior with block_in_place (#5251)

2286273 rt: yield_now defers task until after driver poll (#5223)

993a60b chore: prepare tokio-macros v1.8.2 (#5246)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies rust
opened by dependabot[bot] 0
Bump async-trait from 0.1.58 to 0.1.59
Bumps async-trait from 0.1.58 to 0.1.59.

Release notes

Sourced from async-trait's releases.

0.1.59

Support self: Arc<Self> async methods that have a default implementation provided by the trait (#210)

Commits

c1fba00 Release 0.1.59

b0466bb Merge pull request #223 from dtolnay/arc

066f06e Match specifically Arc<Self>, not any other Arc

d1a4a23 Infer Sync+Send bound for Arc<Self> methods with default body

e5828bf Add regression test for issue 210

9ed6489 Format with rustfmt 1.5.1-nightly

b100ace Merge pull request #222 from dtolnay/bound

643c07c Generalize support for inferred multiple bounds

6c13d6f Move absolute path of bound into InferredBound

cfa9c10 Merge pull request #221 from dtolnay/bound

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies rust
opened by dependabot[bot] 0
Bump nix from 0.25.0 to 0.26.1
Bumps nix from 0.25.0 to 0.26.1.

Changelog

Sourced from nix's changelog.

[0.26.1] - 2022-11-29

Fixed

Fix UB with sys::socket::sockopt::SockType using SOCK_PACKET. (#1821)

[0.26.0] - 2022-11-29

Added

Added SockaddrStorage::{as_unix_addr, as_unix_addr_mut} (#1871)

Added MntFlags and unmount on all of the BSDs.

Added any() and all() to poll::PollFd. (#1877)

Add MntFlags and unmount on all of the BSDs. (#1849)

Added a Statfs::flags method. (#1849)

Added NSFS_MAGIC FsType on Linux and Android. (#1829)

Added sched_getcpu on platforms that support it. (#1825)

Added sched_getaffinity and sched_setaffinity on FreeBSD. (#1804)

Added line_discipline field to Termios on Linux, Android and Haiku (#1805)

Expose the memfd module on FreeBSD (memfd was added in FreeBSD 13) (#1808)

Added domainname field of UtsName on Android and Linux (#1817)

Re-export RLIM_INFINITY from libc (#1831)

Added syncfs(2) on Linux (#1833)

Added faccessat(2) on illumos (#1841)

Added eaccess() on FreeBSD, DragonFly and Linux (glibc and musl). (#1842)

Added IP_TOS SO_PRIORITY and IPV6_TCLASS sockopts for Linux (#1853)

Added new_unnamed and is_unnamed for UnixAddr on Linux and Android. (#1857)

Added SockProtocol::Raw for raw sockets (#1848)

added IP_MTU (IpMtu) IPPROTO_IP sockopt on Linux and Android. (#1865)

Changed

The MSRV is now 1.56.1 (#1792)

... (truncated)

Commits

e7a646d (cargo-release) version 0.26.1

749bf75 Merge #1821

8e91b28 Fix UB in the SO_TYPE sockopt

12fb354 [skip ci] add a CHANGELOG section for the next release

e0af65b (cargo-release) version 0.26.0

1e0de75 Merge #1891

17e56e1 Prepare for release 0.26.0

206556d Merge pull request #1890 from asomers/old-changelogs

ff219e3 [skip ci] add CHANGELOG entries for old point releases.

eabcb48 Merge pull request #1888 from asomers/0.26.0-CHANGELOG

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies rust
opened by dependabot[bot] 0
Bump serde from 1.0.147 to 1.0.149
Bumps serde from 1.0.147 to 1.0.149.

Release notes

Sourced from serde's releases.

v1.0.149

Relax some trait bounds from the Serialize impl of BinaryHeap, BTreeSet, and HashSet (#2333, thanks @jonasbb)

v1.0.148

Support remote derive for generic types that have private fields (#2327)

Commits

0353354 Release 1.0.149

34ae042 Merge pull request #2333 from jonasbb/remove-trait-bounds

cc128fe Remove some Serialize trait bounds

7766103 Release 1.0.148

30f7c71 Merge pull request #2331 from dtolnay/remote

50354c2 Improve error message on remote derive duplicate generics

c4f67e6 Add ui test of duplicate generics in remote derive

0daafe4 Merge pull request #2330 from dtolnay/remote

3702191 Fix Into conversion involving generic remote derive with getter

7328b34 Add test of generic remote derive with getter

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies rust
opened by dependabot[bot] 0
Bump actions/checkout from 2 to 3
Bumps actions/checkout from 2 to 3.

Release notes

Sourced from actions/checkout's releases.

v3.0.0

Updated to the node16 runtime by default

This requires a minimum Actions Runner version of v2.285.0 to run, which is by default available in GHES 3.4 or later.

v2.5.0

What's Changed

Update @actions/core to 1.10.0 by @rentziass in actions/checkout#962

Full Changelog: https://github.com/actions/checkout/compare/v2...v2.5.0

v2.4.2

What's Changed

Add set-safe-directory input to allow customers to take control. (#770) by @TingluoHuang in actions/checkout#776

Prepare changelog for v2.4.2. by @TingluoHuang in actions/checkout#778

Full Changelog: https://github.com/actions/checkout/compare/v2...v2.4.2

v2.4.1

Fixed an issue where checkout failed to run in container jobs due to the new git setting safe.directory

v2.4.0

Convert SSH URLs like org-<ORG_ID>@github.com: to https://github.com/ - pr

v2.3.5

Update dependencies

v2.3.4

Add missing awaits

Swap to Environment Files

v2.3.3

Remove Unneeded commit information from build logs

Add Licensed to verify third party dependencies

v2.3.2

Add Third Party License Information to Dist Files

v2.3.1

Fix default branch resolution for .wiki and when using SSH

v2.3.0

Fallback to the default branch

v2.2.0

Fetch all history for all tags and branches when fetch-depth=0

v2.1.1

Changes to support GHES (here and here)

... (truncated)

Changelog

Sourced from actions/checkout's changelog.

Changelog

v3.1.0

Use @actions/core saveState and getState

Add github-server-url input

v3.0.2

Add input set-safe-directory

v3.0.1

Fixed an issue where checkout failed to run in container jobs due to the new git setting safe.directory

Bumped various npm package versions

v3.0.0

Update to node 16

v2.3.1

Fix default branch resolution for .wiki and when using SSH

v2.3.0

Fallback to the default branch

v2.2.0

Fetch all history for all tags and branches when fetch-depth=0

v2.1.1

Changes to support GHES (here and here)

v2.1.0

Group output

Changes to support GHES alpha release

Persist core.sshCommand for submodules

Add support ssh

Convert submodule SSH URL to HTTPS, when not using SSH

Add submodule support

Follow proxy settings

Fix ref for pr closed event when a pr is merged

Fix issue checking detached when git less than 2.22

v2.0.0

Do not pass cred on command line

Add input persist-credentials

Fallback to REST API to download repo

... (truncated)

Commits

93ea575 Prepare release v3.1.0 (#940)

6a84743 Bump @actions/core to 1.10.0 (#939)

e6d535c Inject GitHub host to be able to clone from another GitHub instance (#922)

2541b12 Prepare changelog for v3.0.2. (#777)

0ffe6f9 Add set-safe-directory input to allow customers to take control. (#770)

dcd71f6 Enforce safe directory (#762)

add3486 Patch to fix the dependbot alert. (#744)

5126516 Bump minimist from 1.2.5 to 1.2.6 (#741)

d50f8ea Add v3.0 release information to changelog (#740)

2d1c119 update test workflows to checkout v3 (#709)

Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR

@dependabot recreate will recreate this PR, overwriting any edits that have been made to it

@dependabot merge will merge this PR after your CI passes on it

@dependabot squash and merge will squash and merge this PR after your CI passes on it

@dependabot cancel merge will cancel a previously requested merge and block automerging

@dependabot reopen will reopen this PR if it is closed

@dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually

@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)

@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

dependencies github_actions
opened by dependabot[bot] 0
Add info about shadowed state to error action placeholders

The error state possibly shadows the good/bad state. It would be good to have placeholders available in the error action that indicate which state is shadowed (uuid, maybe timestamp, ..).
enhancement

opened by flo-at 0
Add "alarm_duration" and maybe "good_duration" on recover/action

It would be nice to be able to add the duration an alarm was good/bad before its state changed to the placeholders. This way you can easily see how long e.g. a service was down or the RAM critically full after the alarm recovered again.
enhancement

opened by flo-at 0
systemd journal logging with extra fields

The systemd journal supports storing extra fields (check/alarm/action name, ..) along with the message. To be able to implement this with the log crate, the currently unstable kv_unstable feature is required.
enhancement

opened by flo-at 0

Releases(v0.3.1)

v0.3.1(Dec 26, 2022)
New checks:

PressureAverage which checks Linux Pressure Stall Information (PSI) for CPU, I/O, and memory.

ProcessExitStatus which runs a process and checks its exit status code. This gives a lot of flexibility to implement custom checks.

Incompatible changes

The default value for "memory" in the MemoryUsage check config was changed from true to false (see #18).

The placeholder alarm_id has been renamed to check_id for more consistent naming.

And some minor fixes and changes.

Happy holidays!
Source code(tar.gz)
Source code(zip)
v0.2.0(Dec 18, 2022)
First release!

Checks

FilesystemUsage

MemoryUsage

Actions

Email

Log

Process

Webhook

Source code(tar.gz)
Source code(zip)