Forum Moderators: Robert Charlton & goodroi
Here's why I ask... I have a site where over the years some sections have spaces between words in the URL encoded with the %20 and other sections where URL's have been encoded with a + for the spaces.
Today, I was doing a site: search for one of the folders that uses %20 and noticed that pages were listed in this order... First, all pages where the url had no %20 in the url, then all pages with a %20. There are 50 pages for this section (one for each state) and the pages for each single word state were first, followed by all of the 2 word states.
But, when I did the exact same site: search on another section of my site that also has 50 state pages but uses the + for encoding, the result set was mixed. i.e not all single words at the top and two words at the bottom.
I haven't been one to assume in all cases that google necessarily ranks pages with a site: search by their relative authority but there is evidence of that because usually your strongest pages are near the top of the list. If so, is google giving less authority to my pages that encode a url with the %20?
My observation was that all url's where a rawurlencode was done (%20) ranked lower on a site search than url's where there was no encoding (Illinois). Yet, if the encoding was with the + sign, rather than %20, then it appeared not to affect the rankings with the site command.
Again, I know site: rankings aren't real rankings, but why would this happen?
Note that the search allinurl:%20 returns many results that do not contain a "%" character. It seems to be treated like any generic seperator. The only separator that is indexed on its own seems to be "_", the underscore character.