Forum Moderators: open

Message Too Old, No Replies

duplicate content

anyone know how I fix this?

         

onionrep

8:45 pm on Jan 30, 2003 (gmt 0)



After the last update I noticed that a site I run has been pr greyed (previously pr6) At first, I thought it was related to the sql worm, then I put it down to the fact that the toolbar is a little unpredictable during updates so decided to hang it out to see what happened.

After reading another post on here I noted that a fellow user had been pr greyed, and later found out that someone had ripped his content off and duplicated it.

I though that this may have been done to me so decided to test the theory.I cut a bunch of text from my homepage, did a google for "the paragraph of words from my home page that are unique to me" and was surprised to see two results, both from my home page. Identical content too, but for one reason only.

The domain addresses shown in the serps were in the form

Site Title ...
..the paragraph of words from my home page that are unique to me..
www.domain.com/- 26k - Cached - Similar pages

Site Title ...
..the paragraph of words from my home page that are unique to me
domain.com/ - 26k - Cached - Similar pages

www.domain.com/- is the address for which I have built all my links, the page that shows a grey pr from pr6

The address domain.com/- has a pr 2!, I dont want it to have pr2, i dont even want it listed, I never put it there.

Ive never promoted the domain as domain.com so am perplexed as to how or why google has picked it up like this. No one links to me as domain.com, and even if they did, it would be a pretty harsh if google decided to penalise a domain as having dupe content.(would be an open door for getting competitors penalised)

I havent spammed, cloaked or done anything remotedly iffy, in fact ive been very cautious and followed the advice given at this and many other forums with regard to creating good user friendly content.

I'd email google and explain things, but the consensus seems to be that they get 1000's of these "why am i banned emails?" and rarely answer them.

Can anyone offer any suggestions as to how I can fix things?

Possibly by using robot.txt .htaccess or other server side configuration.

Thanks for all suggestions.

onionrep

10:34 am on Jan 31, 2003 (gmt 0)



[webmasterworld.com...]

This thread discussed something similar to the problem I have.

It focused on an NT box, can someone help me with a unix alternative?

My <virtual hosts> apache settings are thus

ServerAdmin webmaster@***.co.uk
DocumentRoot /www/***.co.uk/www/
ServerName ***.co.uk
ServerAlias ***.co.uk www.***.co.uk
ScriptAlias /cgi-bin/ /www/***.co.uk/cgi-bin/
ErrorLog /www/***.co.uk/logs/error_log
CustomLog /www/***.co.uk/logs/access_log combined

I really would be much obliged if someone can tell me how to add to/edit the above so that when googlebot visits [***.co.uk...] it no longer sees it, or is redirected to [***.co.uk....]

I have tried removing ***.co.uk
from the ServerAlias line so that it reads

ServerAlias www.***.co.uk

But that just makes the url redirect to another domain hosted on the server. :(

lazerzubb

4:01 pm on Feb 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Are you looking to a 302 server redirect?

onionrep

4:21 pm on Feb 4, 2003 (gmt 0)



Problem now resolved thanks.

Not by me, but a friend changed things and used a 301.

RedirectPermanent I believe.

lazerzubb

4:24 pm on Feb 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Yeah that will do fine too, mixed up my numbers (a 301 would be better in your case).