I don't know what the current situation is, but the amount of crapware back in the day was just staggering; every installer offered to "helpfully" install an extra toolbar or virus scanner or whatnot, which is how people ended up with 3 virus scanners and 8 toolbars in IE.
Not in its entirety, but these are some of the more helpful "global" non-default rules I employ on some of my "more restrictive" / "less advertising" / "less tracking" PiHoles DNS resolvers:
easy-rsa is a CLI utility to build and manage a PKI CA. In laymen's terms, this means to create a root certificate authority, and request and sign certificates, including intermediate CAs and certificate revocation lists (CRL).
robots.txt is a standard and has been for many years. It seems that https://www.robotstxt.org is the main repository for its definition.
Any robots.txt I might suggest wouldn't necessarily be appropriate for anyone else. It's a policy that site stakeholders need to decide for themselves I'd say.