Prometheus monitoring server, includes prometheus, alertmanager and blackbox-exporter. This requires docker to be running (not handled by this role).
This role is still in development.
-
prometheus_targets: Optional list of dictionaries of targets:group: A list of Ansible groups (optional)hosts: A list of hosts (optional)port: The port to be monitoredjobname: The prometheus job-namescrape_interval: Prometheus scrape interval (optional)metrics_path: Path to metrics (optional)
-
prometheus_sd_targets: Optional list of dictionaries of additional targets:groupname: The name of the configuration target file, used to name a configuration file so must be uniquegroup: A list of Ansible groups (optional)hosts: A list of hosts (optional)port: The port to be monitoredjobname: The prometheus job-name
This is intended to be an example. In practice these configuration files could be dynamically generated outside this role, prometheus will automatically reload them.
-
prometheus_custom_targets: Optional list of dictionaries of additional targets with custom arguments, use this if you need to pass custom arguments that aren't supported by the previous two parameters. Each item is a<scrape_config>that will be copied unchanged into the configuration. -
prometheus_port: External Prometheus port, set to0to disable, default9090 -
prometheus_alertmanager_port: External Alertmanager port, set to0to disable, default9093 -
prometheus_blackboxexporter_port: External Blackbox-exporter port, set to0to disable, default9115 -
prometheus_additional_command_args: Additional command line arguments for prometheus -
prometheus_alertmanager_additional_command_args: Additional command line arguments for alertmanager -
prometheus_additional_rules_template: Template with additional alert rules. See https://awesome-prometheus-alerts.grep.to/rules for some ideas but note the labels may be different. -
prometheus_docker_network: Docker network for prometheus Docker applications, defaultprometheus -
prometheus_docker_user: User ID that prometheus should run as, default is the container default -
prometheus_docker_data_volume: Docker volume or host path for Prometheus data, default is a docker volume calledprometheus-data. If this is a host path it must be writeable byprometheus_docker_user.
This role sets the following variables which can be used in other tasks:
prometheus_internal_ip: Internal IP of the Prometheus containerprometheus_blackboxexporter_internal_ip: Internal IP of the Blackbox exporter containerprometheus_alertmanager_internal_ip: Internal IP of the AlertManager container
These are intended for use when you don't want to expose the container ports using standard Docker port-forwarding (set prometheus_*port: 0).
- hosts: localhost
roles:
- role: ome.docker
- role: ome.prometheus
Change prometheus_alertmanager_slack_webhook in playbook.yml to a real Slack webhook. Run:
molecule test --destroy never
molecule login
docker stop fake-metrics
Wait a few minutes, a Slack alert should be generated.
If this fails try creating a manual alert:
curl -H "Content-Type: application/json" -d '[{"labels":{"alertname":"TestAlert1"}}]' localhost:9093/api/v1/alerts
The molecule test also includes a disk space alert configuration.
To test this fill up at least 90% of the /run partition:
molecule login
dd if=/dev/zero of=/run/fill.space bs=1M count=...
Wait a few minutes and you should see a disk space warning.