mastodontech.de ist einer von vielen unabhängigen Mastodon-Servern, mit dem du dich im Fediverse beteiligen kannst.
Offen für alle (über 16) und bereitgestellt von Markus'Blog

Serverstatistik:

1,4 Tsd.
aktive Profile

#prometheus

3 Beiträge3 Beteiligte0 Beiträge heute

Questions for #fediverse #SysAdmin folk who run #prometheus and #grafana on multiple #aws accounts:

Do you use a single Prometheus server or do you have one for each account?

How do you handle auto scaling EC2 servers which could change IPs at any time?

Is it possible for servers to push rather than get pulled from?

Please share any setups that could be relevant and boost far & wide!

Thanks in advance!! ❤️❤️❤️

Got the okay to GPL upsite, the #prometheus exporter I'm currently writing at work. :3

It watches a set of domains and reports time till certificate expiration, HTTP status and request time as well as some errors like certificate name mismatches and DNS resolution failures.

It supports IPv4 and v6 and does a bunch of multiprocessing to bring execution time down.

It will also (soon) support loading a list of domains to watch from an external URL.

More on this probably later this week. :)

"We’re writing to remind you that as of October 22, 2024, Monitoring Query Language (MQL) is no longer a recommended query language for Cloud Monitoring. We made this change to focus on PromQL, the open-source standard for time series querying, which offers similar functionality, a wider user base, and more community resources."

Modern #technoBureaucracy, heir to Promethean myth of endless growth, demands triple decolonization: 1) post-Marxist, of promises of the #technicalSystem (#Arendt, #Mignolo, #Escobar, #Mumford), 2) material and immaterial of #center/ #periphery relationships (#Amin, #Cusicanqui, #Ferdinand), 3) decolonization of nature itself, listening to #subaltern voices and other epistemologies (#Spivak, #Harding). Decolonization also means de-anthropocentrating the world, beyond #Prometheus.

Is your company looking for a keen self-hoster with plenty of #Linux experience? I grew up with #RaspberryPi and have picked up many skills along the way including #React, backend JavaScript (#NodeJS) and #Docker. My current obsession is monitoring all the things with #Grafana, #PRTG and #Prometheus. I’m based in the UK but open to primarily English-speaking roles in Germany, too. Currently wrapping up my Advanced Software Development degree but eager to continue learning! Boosts appreciated :D

In Greek mythology, when the gods were deciding which parts of an animal would be the divine sacrifice, Prometheus tricked Zeus into choosing the bones wrapped in fat rather than the meat. Thus, humans were able to eat the best parts of the meat themselves instead of offering them up. This enraged Zeus.
🎨 Heinrich Friedrich Füger

I've been disappointed about this for at least the last decade, but if you feel that the polling-based designs of Kubernetes and Prometheus are "wrong", here's some science:
arxiv.org/abs/2507.02158

arXiv.orgSignalling Health for Improved Kubernetes Microservice AvailabilityMicroservices are often deployed and managed by a container orchestrator that can detect and fix failures to maintain the service availability critical in many applications. In Poll-based Container Monitoring (PCM), the orchestrator periodically checks container health. While a common approach, PCM requires careful tuning, may degrade service availability, and can be slow to detect container health changes. An alternative is Signal-based Container Monitoring (SCM), where the container signals the orchestrator when its status changes. We present the design, implementation, and evaluation of an SCM approach for Kubernetes and empirically show that it has benefits over PCM, as predicted by a new mathematical model. We compare the service availability of SCM and PCM over six experiments using the SockShop benchmark. SCM does not require that polling intervals are tuned, and yet detects container failure 86\% faster than PCM and container readiness in a comparable time with limited resource overheads. We find PCM can erroneously detect failures, and this reduces service availability by 4\%. We propose that orchestrators offer SCM features for faster failure detection than PCM without erroneous detections or careful tuning.