-
assigned issue to
Monitoring
Issue #25
new
- Every service should respond on /__ping
- The services should export some graphed values and possibly alert the orgas
- Weathermap of the network?
- Machine monitoring (https://github.com/prometheus/node_exporter), web interface (https://github.com/prometheus/promdash) and aggregation of data (https://github.com/prometheus/prometheus).
Comments (6)
-
reporter -
- edited description
-
assigned issue to
-
-
assigned issue to
- edited description
-
assigned issue to
-
For django services: https://github.com/korfuri/django-prometheus
-
Work started at:
-
Work done:
- Services export metrics to be collected by prometheus.
python-lib/prologin/concours/stechec/monitoring.py python-lib/prologin/hfs/monitoring.py python-lib/prologin/masternode/monitoring.py python-lib/prologin/presencesync/monitoring.py python-lib/prologin/rpc/monitoring.py python-lib/prologin/workernode/monitoring.py
- The target url are documented in
etc/prometheus/prometheus.conf
. - Documented in
docs/source/monitoring.rst
Work remaining:
- Update
docs/source/monitoring.rst
to use grafana instead of promdash. promdash is deprecated. - Document alertmanager
- Add alerts
- Log in to comment