homepage Welcome to WebmasterWorld Guest from 54.234.0.85
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / WebmasterWorld / Content, Writing and Copyright
Forum Library, Charter, Moderators: not2easy

Content, Writing and Copyright Forum

    
How can I block ALL those whois sites
from having a large copy of our text from our site
zeus




msg:4612237
 8:37 am on Sep 24, 2013 (gmt 0)

ALL those whois sites are really getting on my nerves, some even copy your full html from you frontpage. My site has been in trouble for years, slowly I think its be cause of all those whois sites, be cause they have sometimes a full copy of my site text, sometimes 200-500 words.

How can I block those or is there a list of most of those sites ips which could be blocked.

 

Alikris




msg:4613824
 6:14 pm on Sep 30, 2013 (gmt 0)

Hia Zeus. I've recently posted a similar thread [webmasterworld.com...]

As I say there, I strongly object to these companies profiling my website. I've blocked them in my htaccess file, but they've already profiled me, so blocking them is useless IMO.

Hopefully someone will be able to answer our questions . . .

Ali.

ken_b




msg:4613825
 6:58 pm on Sep 30, 2013 (gmt 0)

That would be useful info for anyone setting up new sites too.

.

blackpixel




msg:4613826
 7:02 pm on Sep 30, 2013 (gmt 0)

If they outrank you, it means your site is very weak in googles eyes or have some kind of penalty.

not2easy




msg:4613837
 7:57 pm on Sep 30, 2013 (gmt 0)

You can block them the same way you block all unwanted robots. As Alikiris says, until they decide to remove it, it will stay there. Blocking only the IP of the site won't help much, they likely are scraping each other for the data, and other bots are collecting stuff to sell to them. If you don't plan to spend a few hours a day checking your access logs it becomes a wasted effort.

Hoople




msg:4613847
 8:35 pm on Sep 30, 2013 (gmt 0)

My site is only in a few. I put up a blank page late at night, clicked refresh in their settings and poof! duplicate gone.

One could also server them different fodder if their bot's UA or IP is known too <BEG>

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / WebmasterWorld / Content, Writing and Copyright
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved