mastodontech.de ist einer von vielen unabhängigen Mastodon-Servern, mit dem du dich im Fediverse beteiligen kannst.
Offen für alle (über 16) und bereitgestellt von Markus'Blog

Serverstatistik:

1,4 Tsd.
aktive Profile

#crawler

1 Beitrag1 Beteiligte*r0 Beiträge heute
Tomas Norre :verified:<p>Today 14th of Juli, It's 10 Years since I did my first <a href="https://phpc.social/tags/TYPO3" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TYPO3</span></a> <a href="https://phpc.social/tags/Crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Crawler</span></a> contribution, and now I'm the maintainer of it.</p><p><a href="https://blog.tomasnorre.dk/blog/typo3-crawler-10years/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.tomasnorre.dk/blog/typo3-</span><span class="invisible">crawler-10years/</span></a></p><p><a href="https://phpc.social/tags/HappyCrawling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HappyCrawling</span></a> <a href="https://phpc.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a></p>
MisterOpenData<p>Ein Diagramm zu den Zugriffen auf das Open-Data-Portal Schleswig-Holstein. Es wird versucht, Bots zu erkennen. Ab etwa Oktober 2023 nimmt die Belastung durch aggressives KI-Training zu. </p><p><a href="https://norden.social/tags/opendata" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opendata</span></a> <a href="https://norden.social/tags/schleswigholstein" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>schleswigholstein</span></a> <a href="https://norden.social/tags/web" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>web</span></a> <a href="https://norden.social/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a></p>
@francks<p>Our small team vs millions of bots</p><p><a href="https://www.fsf.org/blogs/sysadmin/our-small-team-vs-millions-of-bots" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">fsf.org/blogs/sysadmin/our-sma</span><span class="invisible">ll-team-vs-millions-of-bots</span></a></p><p><a href="https://mstdn.fr/tags/fsf" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>fsf</span></a> <a href="https://mstdn.fr/tags/freesoftware" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>freesoftware</span></a> <a href="https://mstdn.fr/tags/ddos" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ddos</span></a> <a href="https://mstdn.fr/tags/javascrip" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>javascrip</span></a> <a href="https://mstdn.fr/tags/anubis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>anubis</span></a> <a href="https://mstdn.fr/tags/botnet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>botnet</span></a> <a href="https://mstdn.fr/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://mstdn.fr/tags/scraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraper</span></a> <a href="https://mstdn.fr/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> <a href="https://mstdn.fr/tags/proprietarysoftware" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>proprietarysoftware</span></a> <a href="https://mstdn.fr/tags/malware" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>malware</span></a></p>
Arie van Deursen<p>“The problem is whether you create content to sell ads, sell subscriptions, or just to know that people value what you've created, an AI-driven web doesn't reward content creators the way that the old search-driven web did. And that means the deal that Google made to take content in exchange for sending you traffic just doesn't make sense anymore.”</p><p>— Matthew Prince, Cloudflare</p><p><a href="https://blog.cloudflare.com/content-independence-day-no-ai-crawl-without-compensation/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.cloudflare.com/content-in</span><span class="invisible">dependence-day-no-ai-crawl-without-compensation/</span></a></p><p><a href="https://mastodon.acm.org/tags/cloudflare" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cloudflare</span></a> <a href="https://mastodon.acm.org/tags/contentcreators" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>contentcreators</span></a> <a href="https://mastodon.acm.org/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> <a href="https://mastodon.acm.org/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.acm.org/tags/search" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>search</span></a></p>
Erik C. Thauvin<p>Cloudflare Introduces pay per crawl: enabling content owners to charge AI crawlers for access</p><p><a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/cloudflare" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cloudflare</span></a> <a href="https://mastodon.social/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a></p><p><a href="https://blog.cloudflare.com/introducing-pay-per-crawl/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.cloudflare.com/introducin</span><span class="invisible">g-pay-per-crawl/</span></a></p>
𝕂𝚞𝚋𝚒𝚔ℙ𝚒𝚡𝚎𝚕<p>»Pay up or stop scraping – Cloudflare program charges bots for each crawl:<br>Cloudflare now beta testing pay-per-crawl feature to stop endless AI scraping.<br>Cloudflare is now experimenting with tools that will allow content creators to charge a fee to AI crawlers to scrape their websites.«</p><p>This is certainly a good idea, but on the other hand, the competition is trying to eliminate each other. I'm curious… 🍿😎</p><p><a href="https://arstechnica.com/tech-policy/2025/07/pay-up-or-stop-scraping-cloudflare-program-charges-bots-for-each-crawl/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">arstechnica.com/tech-policy/20</span><span class="invisible">25/07/pay-up-or-stop-scraping-cloudflare-program-charges-bots-for-each-crawl/</span></a></p><p><a href="https://chaos.social/tags/cloudflare" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cloudflare</span></a> <a href="https://chaos.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://chaos.social/tags/payorstop" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>payorstop</span></a> <a href="https://chaos.social/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> <a href="https://chaos.social/tags/web" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>web</span></a> <a href="https://chaos.social/tags/website" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>website</span></a> <a href="https://chaos.social/tags/bots" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bots</span></a> <a href="https://chaos.social/tags/aibots" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aibots</span></a></p>
DJM (freelance for hire)<p>Ah aha ah... sorry... Ah ah ah ah</p><p>A proposal to standardise on using an /llms.txt file to provide information to help LLMs use a website at inference time. <br><a href="https://llmstxt.org/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">llmstxt.org/</span><span class="invisible"></span></a></p><p>Remember that you can block their robots from crawling your site/blog:<br><a href="https://www.didiermary.fr/bloquer-ai-bots-chatgpt-openai/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">didiermary.fr/bloquer-ai-bots-</span><span class="invisible">chatgpt-openai/</span></a></p><p><a href="https://masto.ai/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://masto.ai/tags/IA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IA</span></a> <a href="https://masto.ai/tags/AGI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AGI</span></a> <a href="https://masto.ai/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLM</span></a> <a href="https://masto.ai/tags/Robot" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Robot</span></a> <a href="https://masto.ai/tags/Crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Crawler</span></a></p>
teufelswerk<p>Der aus Deutschland (Karlsruhe) stammende B2B Databroker "dealfront.com" brüstet sich mit Aussagen wie "Die Daten, die wir in Dealfront bereitstellen, sind DSGVO-konform und entsprechen den höchsten Datenschutz- und Privatsphäre-Standards." Da dieser Anbieter ohne mein Wissen und meine Einwilligung u.a. meine personenbezogen, aber auch unsere Unternehmensdaten an Spammer verkauft, gecrawlte Daten anreichert und aus diversen Quellen KI-gestützt zusammenführt, stehe ich diesem Anbieter mehr als sehr skeptisch 🖕 gegenüber. Ich werde ihm jetzt auf den Zahn fühlen und euch darüber berichten.</p><p><a href="https://social.tchncs.de/tags/dealfront" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>dealfront</span></a> <a href="https://social.tchncs.de/tags/databroker" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>databroker</span></a> <a href="https://social.tchncs.de/tags/b2b" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>b2b</span></a> <a href="https://social.tchncs.de/tags/ki" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ki</span></a> <a href="https://social.tchncs.de/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> <a href="https://social.tchncs.de/tags/profiling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>profiling</span></a> <a href="https://social.tchncs.de/tags/dsgvo" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>dsgvo</span></a> <a href="https://social.tchncs.de/tags/datenschutz" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datenschutz</span></a> <a href="https://social.tchncs.de/tags/spam" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>spam</span></a></p>
Kevin Karhan :verified:<p><span class="h-card" translate="no"><a href="https://mastodon.pnpde.social/@spielleitung" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>spielleitung</span></a></span> also ich mach's mir professionell einfacher und blockiere einfach alle bekannten <a href="https://infosec.space/tags/Scraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraper</span></a>...</p><ul><li>Wenn ihr weitere Ranges an <a href="https://infosec.space/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://infosec.space/tags/Crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Crawler</span></a>|n kennt: Immer her damit! </li></ul><p><a href="https://github.com/greyhat-academy/lists.d/blob/main/scrapers.ipv4.block.list.tsv" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/greyhat-academy/lis</span><span class="invisible">ts.d/blob/main/scrapers.ipv4.block.list.tsv</span></a></p>
Hacker News<p>Java Virtual Threads Ate My Memory: A Web Crawler's Tale of Speed vs. Memory</p><p><a href="https://dariobalinzo.medium.com/virtual-threads-ate-my-memory-a-web-crawlers-tale-of-speed-vs-memory-a92fc75085f6" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">dariobalinzo.medium.com/virtua</span><span class="invisible">l-threads-ate-my-memory-a-web-crawlers-tale-of-speed-vs-memory-a92fc75085f6</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/Java" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Java</span></a> <a href="https://mastodon.social/tags/Virtual" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Virtual</span></a> <a href="https://mastodon.social/tags/Threads" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Threads</span></a> <a href="https://mastodon.social/tags/Web" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Web</span></a> <a href="https://mastodon.social/tags/Crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Crawler</span></a> <a href="https://mastodon.social/tags/Memory" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Memory</span></a> <a href="https://mastodon.social/tags/Management" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Management</span></a> <a href="https://mastodon.social/tags/Speed" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Speed</span></a> <a href="https://mastodon.social/tags/Optimization" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Optimization</span></a></p>
Camelia :tranarchy_a_nonbinary: 🇵🇸<p>Well, that's it. I've officially started using <a href="https://tech.lgbt/tags/Anubis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Anubis</span></a> to protect my self-hosted <a href="https://tech.lgbt/tags/Forgejo" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Forgejo</span></a> instance.</p><p>I didn't want to do it at first, but my nginx and fail2ban configurations weren't efficient enough.</p><p>Down with LLMs!</p><p><a href="https://tech.lgbt/tags/LLM" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LLM</span></a> <a href="https://tech.lgbt/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a></p>
Tomas Norre :verified:<p>Just released version 12.0.8 of the <a href="https://phpc.social/tags/TYPO3" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TYPO3</span></a> <a href="https://phpc.social/tags/Crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Crawler</span></a>, with a number of fixes.</p><p>Thanks for all contributions. <a href="https://phpc.social/tags/HappyCrawling" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>HappyCrawling</span></a></p><p><a href="https://github.com/tomasnorre/crawler/releases/tag/12.0.8" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/tomasnorre/crawler/</span><span class="invisible">releases/tag/12.0.8</span></a></p>
noisio<p>How do you deal with web crawlers and bots on your personal websites?<br>There is a lot of automated traffic I can detect at my own. Some of yours have this short living captchas, that don't even need to be filled out and disappear in just a second.<br>While starting investigating in this topic, I can find as many info about anti-webcrawlers as anti-anti webcrawlers and get lost very soon.</p><p>What are solutions you have found for this?</p><p><a href="https://chaos.social/tags/website" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>website</span></a> <a href="https://chaos.social/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> <a href="https://chaos.social/tags/bot" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bot</span></a> <a href="https://chaos.social/tags/defense" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>defense</span></a></p>
Leszek<p><span class="h-card" translate="no"><a href="https://mastodon.social/@tdp_org" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>tdp_org</span></a></span> unbelievable! I've set up <a href="https://genomic.social/tags/nepenthes" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nepenthes</span></a> tarpit in my personal blog and reached over 1 million requests from a <a href="https://genomic.social/tags/amazon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>amazon</span></a> <a href="https://genomic.social/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> alone in less than 3 months! Other bots typically gave up after crawling tens of thousands of pages of bulshit. Naturally, my robots.txt informs not to crawl the tarpit ...</p>
Larvitz :fedora: :redhat:<p>Anubis - Weigh the soul of incoming HTTP requests using proof-of-work to stop AI crawlers (<a href="https://anubis.techaro.lol" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">anubis.techaro.lol</span><span class="invisible"></span></a>)</p><p>I give that a try. Maybe it can reduce the AI crawler mess a little bit on my servers.</p><p><a href="https://burningboard.net/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://burningboard.net/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> <a href="https://burningboard.net/tags/aicrawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aicrawler</span></a> <a href="https://burningboard.net/tags/fckai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>fckai</span></a> <a href="https://burningboard.net/tags/anibus" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>anibus</span></a></p>
Orhun Parmaksız 👾<p>Is this the future of terminal gaming? 🤯</p><p>⚔️ **ratthew** — A 3D dungeon crawler in the terminal.</p><p>🦀 Written in Rust!</p><p>🏗️ Built with <span class="h-card" translate="no"><a href="https://fosstodon.org/@ratatui_rs" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>ratatui_rs</span></a></span> + <span class="h-card" translate="no"><a href="https://mastodon.social/@bevy" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>bevy</span></a></span> </p><p>⭐ GitHub: <a href="https://github.com/cxreiff/ratthew" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/cxreiff/ratthew</span><span class="invisible"></span></a> (WIP)</p><p><a href="https://fosstodon.org/tags/rustlang" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>rustlang</span></a> <a href="https://fosstodon.org/tags/ratatui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ratatui</span></a> <a href="https://fosstodon.org/tags/tui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tui</span></a> <a href="https://fosstodon.org/tags/terminal" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>terminal</span></a> <a href="https://fosstodon.org/tags/gaming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gaming</span></a> <a href="https://fosstodon.org/tags/3d" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>3d</span></a> <a href="https://fosstodon.org/tags/bevy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bevy</span></a> <a href="https://fosstodon.org/tags/commandline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>commandline</span></a> <a href="https://fosstodon.org/tags/dungeon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>dungeon</span></a> <a href="https://fosstodon.org/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a></p>
Yannick Paulsen<p>Der Druck auf öffentliche und gemeinnützige Infrastrukturen steigt. Ob nun Open-Access-Repositorien oder wie hier im Text <a href="https://openbiblio.social/tags/Wikimedia" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Wikimedia</span></a>: <a href="https://diff.wikimedia.org/2025/04/01/how-crawlers-impact-the-operations-of-the-wikimedia-projects/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">diff.wikimedia.org/2025/04/01/</span><span class="invisible">how-crawlers-impact-the-operations-of-the-wikimedia-projects/</span></a></p><p>Es ist gut, dass die Inhalte in das Training von Künstlicher Intelligenz einbezogen werden, aber bei der unverhältnismäßigen Belastung als Ergebnis von kommerziellen Interessen überlege ich, ob es nicht eigentlich einen Ausgleich braucht.</p><p><a href="https://openbiblio.social/tags/Crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Crawler</span></a> <a href="https://openbiblio.social/tags/KI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KI</span></a> <a href="https://openbiblio.social/tags/OpenAccess" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenAccess</span></a></p>
Michiel Scholten<p>Why I am sort of afraid to share my projects now, and link to my own Git server <a href="https://dammit.nl/afraid-to-git.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">dammit.nl/afraid-to-git.html</span><span class="invisible"></span></a> <a href="https://mastodon.social/tags/git" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>git</span></a> <a href="https://mastodon.social/tags/github" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>github</span></a> <a href="https://mastodon.social/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://mastodon.social/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://mastodon.social/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a></p>
LinuxNews.de<p>Nachdem diverse <a href="https://social.anoxinon.de/tags/ki" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ki</span></a> <a href="https://social.anoxinon.de/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://social.anoxinon.de/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> besonders respektvoll mit den öffentlichen Ressourcen von Open Source Projekten umgehen, habe ich mich dazu entschlossen eben diese auszusperren. Wir hatten in der Vergangenheit crawls, die im <a href="https://social.anoxinon.de/tags/monitoring" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>monitoring</span></a> als <a href="https://social.anoxinon.de/tags/ddos" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ddos</span></a> gewertet wurden. </p><p>Diverse AS erfreuen sich nun einem dauerhaften 429, einige wenige die es für alle kaputt machen…</p>
Alex<p>Sad to see <a href="https://mastodon.org.uk/tags/lore" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lore</span></a> had to check I wasn't a bot to get through to the archive. I guess these are the <a href="https://mastodon.org.uk/tags/crawler" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>crawler</span></a> ridden times we live in now.</p>