How do you keep your blogs from being cached by search engines?
-
I don’t want any of my blogs cached by google, yahoo or any search engines. I keep hearing about crawling, robots, meta some or other. I don’t understand any of this.
The blog I need help with is: (visible only to logged in users)
-
Before we can provide a relevant answer we need to know if your blog is self-hosted or hosted on WordPress.com. Kindly provide a link starting with http://
-
http://longshotsblues.wordpress.com/
And please give detail by detail. Like I stated I’ve read about meta tags, nofollow, crawling, robots. They say put something in Header. What the Header?
I’m no computer wiz with all this technology stuff so please provide a sample of what I need to do step by step.
Even what is under here.
Allowed markup: a blockquote code em strong ul ol li.
You can also put code in between backtick ( ` ) characters.I have no Idea what this means or how it’s done.
Thanks
-
And I meant to say WHAT IS THE HEADER.
My Blog’s picture or would it be the individual Blog being written?
-
If you do not want your blog/site cataloged by search engines, and you are asking about a wp.com blog, then:
from your dashboard, go to
Settings —> Privacy Settings —> on the site visibility lines,
tick: “I would like my blog to be private, visible only to users I choose”If you do that, then an issue with your header is irrelevant.
-
Or if you would like people who know about your site to be able to find it (and your blog would not be private so you would have to invite your readers:
from your dashboard, go to
Settings —> Privacy Settings —> on the site visibility lines,
tick: “I would like to block search engines, but allow normal visitors “(to tell the truth, I think this is the option you want?)
-
I did this this but I could no longer click onto the tags I put on my Blogs, to bring up other peoples blogs under the same tags. So there has to be another solution.
-
Well, if you want to be a part of the global tags on wordpress, then you must allow search engines from the web, net, interweb, whatever it’s called now…
You can not choose to have your blog available to only wp.com users.
Or am I misunderstanding what you are asking?
-
This is strictly about disabling search engines from CACHEING the blogs.
I have seen Titles and Links on Google, etc… WITHOUT the CACHE part underneath them to be abled to be clicked on. I don’t want snapshots taking of my pages. If I delete them for any reason, I’m doing so for a purpose. I do not care for these search engines to be able to snapshot the page as it was written and be kept with a CACHE link that can bring up content I already deleted because I no longer wanted it there to be seen. I realize with the Internet there is no privacy anymore. But this is getting ridiculous. If you delete something, Title and ALL contents should be removed.
But these search engines take snapshots of the page and put a link ” cache ” so it can be brought up anyway whether you’ve deleted the blog and content already.
This should be able to be prevented. But by going to Privacy settings and setting your Blog to block search engines. You also disable your own tags from bringing up other blogs on similar subjects.
Like I said there has to be a another solution to stop this cache process.
-
You can do this in Google Webmaster Tools:
http://googlewebmastercentral.blogspot.com/2007/04/requesting-removal-of-content-from-our.html
-
That will, of course, only work for Google. There’s no solution I know of for Yahoo, Bing, etc. I don’t think Yahoo China ever deletes anything.
It’s not really a WordPress issue though; it’s a search engine issue.
-
Tellyworth. I know about Google Webmaster Tools and the request removal.
That is NOT what I’m asking. I am not talking about how to get a BLOG already posted REMOVED.
I am asking how you can PREVENT one from being cached to BEGIN with.
HOW TO PREVENT ANY SEARCH ENGINE FROM TAKING A SNAPSHOT OF YOUR BLOG OR ANY PAGE YOU WRITE, THEN CACHING IT.
I thank you all, but there seems to be lack of COMPREHENSION here.
I don’t care if one of my blogs written shows up on a search engine. I just don’t care for them to CACHE the blog. That way if I want to delete it. IT’S GONE COMPLETELY. No Title or Contents. Or at least 100% No Contents even if Title would remain.
I want to know HOW DO YOU PREVENT A BLOG FROM BEING CACHED, in the first place. I can’t make it any clearer.
-
Your best solution is if you don’t want something cached is to not post it in the first place.
-
Really not much of an answer slikbonez.
I think we should have a right to write and own what we write. When a search engine snaps a picture of something they don’t own, then puts it in a cache so it can be brought up as if they wrote and owned it. I think that is bull.
Everyone should have the freedom to write and post, and if on a later time and date want to delete it. It then should be deleted. Not cached by a search engine, so it can be brought up anyway.
So obviously you didn’t know the answer to the question which is fine. But don’t insult me and others like me that feel we should have total control over our own blogs. I asked a very legitimate question. How to prevent something from being cached to begin with. I’ve read lots of things about meta this and that.
I came across this:
http://googlewebmastercentral.blogspot.com/2007/04/requesting-removal-of-content-from-our.html
As a site owner, you control what content of your site is indexed in search engines. The easiest way to let search engines know what content you don’t want indexed is to use a robots.txt file or robots meta tag.
But I don’t understand how to use robots.txt file or robots meta tag. No matter how many times I’ve read this page just don’t understand how to apply these things to a blog.
-
http://googlewebmastercentral.blogspot.com/2007/04/requesting-removal-of-content-from-our.html
As a site owner, you control what content of your site is indexed in search engines. The easiest way to let search engines know what content you don’t want indexed is to use a robots.txt file or robots meta tag.
But I don’t understand how to use robots.txt file or robots meta tag. No matter how many times I’ve read this page or others on this subject, I just don’t understand how to apply these things ( robots.txt file or robots meta tag ) to a blog.
“”” In fact I would need step by step instructions with a sample blog. “”””
Or illustrations, which I guess is the same thing. I’m no computer Tech whiz.
Wish I was. -
You are aware that you have the following option “I would like to block search engines, but allow normal visitors ” in your privacy settings ?
-
If you read all the comments I think you’ll discover that this was already discussed. And why it is of no use. Please don’t answer back with anything else unless you can literally assist me.
-
@longshotsblues This is not a WP.com issue and should have never been brought up here in the forums in the first place if you are going to continue to be rude to people then you will end up finding this thread closed.
As for the robots.txt file this can not be achieved here at WP.com because we as members don’t have access to the underlying back-end files.
-
The HEADER refers to the underlying HTML code, which us bloggers do not have access to here on WordPress.com blogs. That is relevant for people who have self-hosted blogs. (An HTML page at its most basic consists of a HEAD and BODY.)
Tess and slikbonez already provided you with the relevant answer for those of us here on WordPress.com.
from your dashboard, go to
Settings —> Privacy Settings —> on the site visibility lines,
tick: “I would like to block search engines, but allow normal visitors “This will stop well-behaved search engines from crawling your blog. No crawl means no cache. The fact that this also removes your blog from the WordPress.com global tag pages is the way it works here presently.
And, as you stated at the very beginning, if you don’t understand any of this, then you owe it to yourself to take the time to educate yourself about it.
- http://en.wikipedia.org/wiki/Markup_language
- http://en.wikipedia.org/wiki/HTML
- http://www.w3schools.com/
And I think you haven’t noticed that several people have answered your question as it relates to WordPress.com blog already. It may not be the answer you want to get, but that seems to be the answer. Cheers!
-
Oh just to tie up one loose point, the minute you change your PRIVACY SETTINGS, as mentioned several times above, this will automattically change the ROBOTS TEXT for your WordPress.com blog to tell search engines to leave your blog alone.
- The topic ‘How do you keep your blogs from being cached by search engines?’ is closed to new replies.