HomeNotesRobots.txt
 

Robots.txt

I have created a robots.txt in root directory...

 

User-agent: *

Disallow: /

 

 

Isn't this supposed to tell any bot not to index anything from the site?

Google still indexing even for a year. Am I entitled to file a lawsuit against Google simply because they ignored robots.txt? Maybe robots.txt isn't enough? Maybe Dolphin was made to be "public" and ignores robots.txt? Whatever the case may be, it needs to be fixed. Wordpress has option if want your-site searchable or not. I think Dolphin should include this. 

Plussed by

 
 
 
 

Comments

Oldest First
|
Threaded
 
 
Please login to post a comment.
Nathan Paton
You should be able to have it removed through a number of options. I recommend you read this page: http://www.google.com/support/webmasters/bin/answer.py?answer=164734
Audanbal
Thank you, Nathan.

Obviously, robots.txt is a poor choice. I just created password-protect directory in home folder. Do I need to add in .htaccess in dolphin root as well?
Nathan Paton
If you want to disallow all bots from indexing your site, you'd have to add the following to your installation's root directory:

User-agent: *
Disallow: /

I think the reason why you're having problems is because you used "User-Agent" instead of "User-agent." I don't know how bots interpret it, though.

If you want to make your site password-protected through your web server, you'll have to edit the root .htaccess file. I should warn you that some parts of Dolphin will see more stop working under these conditions.
Audanbal
Actually, it's User-agent: * in the robots.txt in dolphin root directory and untouched for a year. I must have gone dyslexic whilst typing in here. :oþ Thus, edited in the post. Anyway, I am still puzzled to why googlebot ignores this.
dolphin_jay
<meta name="robots" content="noindex"> place that in your <head> section. from what i read its because your url is linked to or is linked to from another website so it gets index from there..

you can also use google web master tools to remove your domain name manualy found here: https://www.google.com/webmasters/tools/removals?pli=1

this seems more like a forum post more then a blog to me.. but whatever.
gameutopia
A while back I read a post by Matt Cutts from Google on his blog, I can't locate the exact link at the moment, but he claimed that technically if a site links to yours they can still include it in the index without actually crawling your site. Something about they can gather enough data based on the content, description, and your link on another site that links to your site, and use that info as the search result without touching your site.

Adding noindex is an option, preventing anyone from viewing see more it with password protection. You might be able to do some .htaccess blocking and preventing crawling to your site too.

You can check this post by Matt:
http://www.mattcutts.com/blog/overdoing-url-removals/

It isn't the one I was looking for, and if you search his blog site you might find the one with more details. He is sort of Google's goto guy for the webmaster world.
Audanbal
By default, without robots.txt. Click the Dashboard and choose Privacy Settings. Here you can customise who has access to all of your data. I recommend changing your default privacy settings to, at least, members so that non-members cannot access your information. On your profile page, click the Edit button to customise the privacy settings on your profile.

What's the differences between Default and Public found in Edit Profile Privacy? So, choosing Members/Friends/Only Me... does this too prevent see more crawling/indexing from Google (and the like)?
 
 
 
PET:0.079567193984985