• Plans & Pricing
  • Log in
  • Get started
  • WordPress Hosting
  • WordPress for Agencies
  • Become an Affiliate
  • Domain Names
  • AI Website Builder
  • Website Builder
  • Create a Blog
  • Newsletter
  • Professional Email
  • Website Design Services
  • Commerce
  • WordPress Studio
  • Enterprise WordPress 
  • Overview
  • WordPress Themes
  • WordPress Plugins
  • WordPress Patterns
  • Google Apps
  • Support Center
  • WordPress News
  • Business Name Generator
  • Logo Maker
  • Discover New Posts
  • Popular Tags
  • Blog Search
Get started
  • Sign up
  • Log in
About
  • Plans & Pricing
Products
  • WordPress Hosting
  • WordPress for Agencies
  • Become an Affiliate
  • Domain Names
  • AI Website Builder
  • Website Builder
  • Create a Blog
  • Newsletter
  • Professional Email
  • Website Design Services
  • Commerce
  • WordPress Studio
  • Enterprise WordPress  
Features
  • Overview
  • WordPress Themes
  • WordPress Plugins
  • WordPress Patterns
  • Google Apps
Resources
  • Support Center
  • WordPress News
  • Business Name Generator
  • Logo Maker
  • Discover New Posts
  • Popular Tags
  • Blog Search
Jetpack App
  • Learn more
  • Support Center
  • Guides
  • Courses
  • Forums
  • Contact
Search
  • Support Center
  • Guides
  • Courses
  • Forums
  • Contact
Forums / Search engine related features

Search engine related features

  • Unknown's avatar
    derekblog · Member · Jun 4, 2007 at 12:21 pm
    • Copy link Copy link
    • Add topic to favorites Add topic to favorites

    Some ideas for controlling search engines:

    Deleted blogs’ robots.txt
    All deleted blogs should have their robots.txt set to Disallow: /

    Wp-login index
    Stop wp-login.php from being indexed by adding <meta name=”robots” content=”noindex” />

    Disable archiving
    An option to disable archiving by search engines by adding <meta name=”robots” content=”noarchive” /> and Disallow the ia_archiver robot.

    Disable pings
    An option to disable pings to ping servers (such as weblogs.com.)

    Disable feeds
    An option to disable feeds should help to keep out some bots, (although it’s quite a sacrifice.)

  • Unknown's avatar
    nosysnoop · Member · Jun 4, 2007 at 1:48 pm
    • Copy link Copy link

    Controlling search engines for a better pagerank?

  • Unknown's avatar
    derekblog · Member · Jun 4, 2007 at 2:57 pm
    • Copy link Copy link

    No, it won’t boost your Google PageRank. It’s about protecting privacy.

  • Unknown's avatar
    timethief · Member · Jun 4, 2007 at 4:33 pm
    • Copy link Copy link

    @derekblog
    Please don’t forget to send staff a feedback including these ideas in it.

  • Unknown's avatar
    drmike · Member · Jun 4, 2007 at 6:06 pm
    • Copy link Copy link

    Um, we already have this feature. Dashboard -> Options -> privacy -> Don’t allow search engines. Turns off all of that.

  • Unknown's avatar
    timethief · Member · Jun 4, 2007 at 6:50 pm
    • Copy link Copy link

    Thanks drmike :)

  • Unknown's avatar
    derekblog · Member · Jun 4, 2007 at 7:00 pm
    • Copy link Copy link

    @ drmike, people forget to disallow search engines before they delete a blog.
    @ timethief, ok, I’ve send a feedback, thanks.

  • Unknown's avatar
    drmike · Member · Jun 4, 2007 at 7:09 pm
    • Copy link Copy link

    Actually I believe the header for a deleted blog page is a 404 report that the search engines should be picking up on.

    Would it really matter though? Once the blog’s content is gone and the same message is repeated over and over again to teh spiders, the site would be dropped fairly quickly.

  • Unknown's avatar
    derekblog · Member · Jun 4, 2007 at 7:36 pm
    • Copy link Copy link

    The internet archive doesn’t drop it, unless the robots.txt is set to disallow.

  • Unknown's avatar
    drmike · Member · Jun 4, 2007 at 7:47 pm
    • Copy link Copy link

    If IA isn’t obeying ‘noindex,nofollow,’ that’s an issue you may want to bring up with them. When you choose the “Do not let search engines in” option, the following is placed within the header:

    <meta name='robots' content='noindex,nofollow' />

    You can also opt out via email at info at archive dot org.

    You can also set your blog to be private with the third option on that page. Gotta admit that even with the privacy setting set to 2 or 3, some search engines will index your site. That’ll happen even with a robots.txt file. If you want privacy, that’s probably going to be your bestoption. Either that or finding a host, installing the software yourself, and password protecting the directory teh blog sits in.

  • Unknown's avatar
    derekblog · Member · Jun 4, 2007 at 7:52 pm
    • Copy link Copy link

    IA obeys those tags. The problem is when you delete a blog, IA doesn’t remove it.

  • Unknown's avatar
    derekblog · Member · Jun 24, 2007 at 1:50 pm
    • Copy link Copy link

    An easy way to solve problem 1 and 3 is to disallow ia_archiver from wordpress.com.

  • Unknown's avatar
    timethief · Member · Jun 24, 2007 at 3:05 pm
    • Copy link Copy link

    Thanks for contributing.

  • Unknown's avatar
    drmike · Member · Jun 24, 2007 at 3:57 pm
    • Copy link Copy link

    There’s not contributing if they’re posting here in the forums. They need to send this in via feedback on Monday.

    Again, if IA isn’t obeying ‘noindex,nofollow’ then teh issue is with them. If they’re not obeying internet standards, then they are the cause of their own issue.

  • Unknown's avatar
    timethief · Member · Jun 24, 2007 at 4:00 pm
    • Copy link Copy link

    Sorry — I assumed that the blogger would follow the instructions I gave him in the third post above and send in a feedback to staff.

  • Unknown's avatar
    derekblog · Member · Jun 24, 2007 at 5:24 pm
    • Copy link Copy link

    @ drmike, IA copies everything from everywhere and keeps every copy forever and publicly. They do obey meta-tags, but the meta-tags are not retroactive in IA. So the robots.txt is the only option here.
    @ timethief, I’ll send in an extra feedback to staff tomorrow.

  • Unknown's avatar
    drmike · Member · Jun 24, 2007 at 6:09 pm
    • Copy link Copy link

    You can also opt out as noted up above. Says that on their website.

  • Unknown's avatar
    timethief · Member · Jun 24, 2007 at 6:09 pm
    • Copy link Copy link

    Thanks for replying and letting us know you have sent in a feedback and will send in another one.

  • Unknown's avatar
    derekblog · Member · Jul 14, 2007 at 9:09 am
    • Copy link Copy link

    The Internet Archive also uses archive.org_bot and ia_archiver-web.archive.org

  • Unknown's avatar
    sohamdas · Member · Oct 5, 2007 at 7:55 am
    • Copy link Copy link

    Hi is it possible to get a list of all search engine terms used to come to my blog? If yes then how? I would like to know about it…

1 2
  • The topic ‘Search engine related features’ is closed to new replies.

Tags

  • feeds
  • idea
  • ideas
  • noarchive
  • noindex
  • ping
  • robots
  • wp-login.php

About this topic

  • In: Ideas
  • 6 participants
  • 20 replies
  • Last activity 18 years
  • Latest reply from derekblog

Couldn't find what you needed?

Contact us

Contact us

Get answers from our AI assistant, with access to 24/7 expert human support on paid plans.

Browse our guides

Browse our guides

Find step-by-step solutions to common questions in our comprehensive guides.

WordPress.com

Products
  • WordPress Hosting
  • WordPress for Agencies
  • Become an Affiliate
  • Domain Names
  • AI Website Builder
  • Website Builder
  • Create a Blog
  • Professional Email
  • Website Design Services
  • WordPress Studio
  • Enterprise WordPress
Features
  • Overview
  • WordPress Themes
  • WordPress Plugins
  • WordPress Patterns
  • Google Apps
Resources
  • WordPress.com Blog
  • Business Name Generator
  • Logo Maker
  • WordPress.com Reader
  • Accessibility
  • Remove Subscriptions
Help
  • Support Center
  • Guides
  • Courses
  • Forums
  • Contact
  • Developer Resources
Company
  • About
  • Press
  • Terms of Service
  • Privacy Policy
  • Do Not Sell or Share My Personal Information
  • Privacy Notice for California Users
DeutschEspañolFrançaisBahasa IndonesiaItalianoNederlandsPortuguês do BrasilSvenskaTürkçeРусскийالعربيةעִבְרִית日本語한국어简体中文繁體中文English

Mobile Apps

  • Download on the App Store
  • Get it on Google Play

Social Media

  • WordPress.com on Facebook
  • WordPress.com on X (Twitter)
  • WordPress.com on Instagram
  • WordPress.com on YouTube

Automattic

Automattic
Work With Us
    • WordPress.com Forums
    • Sign up
    • Log in
    • Copy shortlink
    • Report this content
    • Manage subscriptions