Forum Moderators: open
PR3 - [exampledomain.com...]
PR5 - [exampledomain.com...]
what can I do to fix this? I didn't even know this page was/could be a PR5 before I stumbled across this.
That's assuming that- the toolbar caches PR values (no reason why it shouldn't) and that it differentiates between lower and upper case in the URL (again, no reason why not).
PR3 - [exampledomain.com...]
PR5 - [exampledomain.com...]If at all possible, try and track down the site that is linking to you using the capitalised version (or the un-capitalised, whichever you want rid of) and get them to change it. I can't say I've ever heard of Google indexing a capitalised version of page unless it was linked to using that.
As InternetHeaven suggested, find out who is linking to you with the wrong case. Make sure it isn't you -- quite likely if you are used to the sloppy Microsoft naming paradigm.
One reason I prefer being hosted by Unix/Linux machines is to avoid this very problem. If someone (e.g., you) links to a page of yours, and uses the wrong case, they find out really quick. The link won't work.
-- Rich
[exampledomain.com...]
wehn you remove www from above url the PR might change again
[exampledomain.com...]
This is strage but this is how it happens. So its essential that all the link building is done on one identical url.
If so could this be a duplicate content penalty with the older page getting the higher PR?
Are your statements regarding "penalties" and "older pages" simply in relation to duplicate pages within a site or are you suggesting that those are factors between sites aswell?
There are huge lists of reasons why Google could not implement such on opposing sites but I conceed that it may be possible for Google to make such a bold attack on duplications within a site itself.
From the way the original message is worded though, I would assume that he is talking about the same page and that the concern is that Google has "created" two pages whereas he believes there is only one.
Google stores its index on Linux servers, not Windows. On Linux servers, "Widgets.htm" and "widgets.htm" are two different files, whereas on Windows, they are the same. The fact that your site is hosted on Windows is not as important as the fact that the index is parsed and categorized as case-sensitive file names.
That would git you a duplicate content problem within the cached Google index, even if it doesn't occur on your server - because the retrieved files co-exist in the cache, whereas they don't (and can't) on your server. In your example, as you're talking about directory names father than file names, then it could be that the entire directory is duplicated.
PR3 - [exampledomain.com...]
PR5 - [exampledomain.com...]
Craig,If the server is Linux, then do both forms actually exist on the site? If you can access them both, then there are either two directories, or something at your server is allowing sloppy file paths through.
-- Rich
<add>
Just thought of something else. It could be a Linux server, but it might be using the Microsoft filesystem FAT32 or NTFS. In either case, the filesystem only pretends to differentiate "Wiget-Store" from "wiget-store", but in fact treats them as the same entity. In other words, it is not offering you the full benefits of Linux/Unix.
</add>[edited by: RFranzen at 4:40 pm (utc) on Nov. 3, 2004]
Any chance this could be due to some faulty mod rewrite work?
It's probably not faulty, just not dealing with the precise situation you have here. If you are using mod_rewrite to make the directory name from a variable, and both the capitalized and non-capitalized word calls the same page on your server, then you've created a situation where your URLs have become case-insensitive (because you're not using "real" file or directory names) even though your server is running Linux.
However, Google is caching plain HTML files, so it can't see the difference between a mod_rewrite URL and a static one (which is the whole idea of using mod_rewrite in the first place).
You may need to adjust your rules to account for this situation - but the complexity depends on what kind of URL rewriting you are doing. Also, as I said, my "theory" is pure guesswork, so it might not be that at all.
then do both forms actually exist on the site?.
Not sure what you mean. There is only one page that either version of that URL points to.
might be using the Microsoft filesystem FAT32
Not sure about that one, I don't think so, but I'll check.
However, Google is caching plain HTML files, so it can't see the difference between a mod_rewrite URL and a static one (which is the whole idea of using mod_rewrite in the first place).You may need to adjust your rules to account for this situation - but the complexity depends on what kind of URL rewriting you are doing. Also, as I said, my "theory" is pure guesswork, so it might not be that at all.
I think you might be onto something there, but how can I know for sure?