Quotidien - January 14, 2022 - grolimur // shaarli

Quotidien Shaarli

Tous les liens d'un jour sur une page.

January 14, 2022

Portail de l'Union européenne agrégeant des contributions scientifiques de diverses natures (publications, données, logiciels, autres) et de diverses sources

science publication recherche

A note on unsupported rules in robots.txt | Google Search Central Blog | Google Developers

La justification de Google pour ne pas respecter les robots.txt.

Ce qu'il faut faire maintenant:

noindex in robots meta tags: Supported both in the HTTP response headers and in HTML, the noindex directive is the most effective way to remove URLs from the index when crawling is allowed.

Ce qui donne ça:
<meta name="robots" content="noindex"

À tester pour voir si ça fonctionne vraiment.

web moteur recherche