Disallow Internet Archive caching
-
There is a service on the internet to see how pages used to look liked called The Wayback Machine (http://www.archive.org/web/web.php). I would like to know if it is possible to block them from a WordPress.com, since no robots.txt or custom meta tags are allowed.
-
My guess would be “no”. I’d think that’d require FTP access to the .html files to edit the nofollow ugc etc spider options in the metadata part of the file. WordPress.com is a multiuser host, the users have restrictions on them….one of those is that we can’t edit the .html files, another is that it strips code off when trying to add Javascript etc for security purpose.
I could be wrong, but that’s my guess. You may have to download the WordPress program from http://www.wordpress.org and install it on a paid host to get FTP access to the .html
-
Dashboard -> Options -> Privacy -> Disallow search engines.
You can also request that sites that you control be removed from their servers. Email them at info ( at ) archive ( dot ) org. You can read details about this here:
http://www.archive.org/about/terms.php
Hope this helps,
-drmike -
-
-
Dr Mike’s email them workaround doesn’t block it from other search engines. But do be warned that each of those search engines will keep a cache of your blog. Why are you particularly adamant that the Wayback Machine alone not have a copy?
-
-
All search engines will cache your blog if you allow them to look at your site. My last blog went down and I am still finding old entries on Yahoo over a year later.
-
I don’t like the idea that my site is cached into some kind of history database.
Then you’re going to have to block the search engines as all of them do caching.
- The topic ‘Disallow Internet Archive caching’ is closed to new replies.