Forum Moderators: Robert Charlton & goodroi
When copied to the browser it appears (of course it should) with '+' symbol in between the words.
I find this strange. I believe this URL would appear as:
http://subdomain.example.com/folder/quest/002/This%20is%20a%20question
because %20 represents space.
Does the http://subdomain.example.com/folder/quest/002/ return the same page content? If so, Google may index these instead of the longer URL, in which case you would have a duplicate content.
The CMS generates pages like this:
http://subdomain.example.com/folder/quest/002/This is a question
When copied to the browser it appears (of course it should) with '+' symbol in between the words.
The space character is unsafe because significant spaces may disappear and insignificant spaces may be introduced when URLs are transcribed or typeset or subjected to the treatment of word-processing programs.
...
All unsafe characters must always be encoded within a URL.
Within the query string, the plus sign is reserved as shorthand notation for a space.[my emphasis]
http://subdomain.example.com/folder/quest/002/This is a question=page=1
No, it gets redirected to this longer version. Sometimes Google webmaster tool reports traffic for both...
Well, no, it shouldn't. When you say "copied to browser" do you mean actual copying-and-pasting into your local address bar? Or something else?
Which form of the URL appears in your logs? Space, plus or percent?I don't have access to it as yet, have requested access to it. So the URL appear in the log is considered the ideal form of this 3 variants? Space or + or %20 in my case?
Within the query string, the plus sign is reserved as shorthand notation for a space.
So what it appears is happening here is that during the URL rewrite, the query string that contained + has been appended as the last part of the rewritten URL.Can you please explain it further for me?
This looks strange. Are you sure this is how your URL looks like and that there isn't a question mark in URL, e.g. does it in fact end with .... question?page=1I'm sorry there is a question mark in the URL for the paginated content URLs.
Are you sure that there is no error in redirect? I have seen a cases where redirects on occasions do not work - especially in .net (IIS). So normally URLs would be redirecting, then for a few requests they would not for various reasons. So when you see this in analytics, I would check the logs for that day to ensure that all request for that URL have in fact responded with 301.
I'm wondering whether these query strings might have been generated at some point as user-generated questions, which then got saved in a Q&A section on your site
and that Google might be seeing internal server files, along with rewritten files.
Did you at some point say which URL actually reaches the page? It's either + or %20 but not both. If it's + then use that in links. If it's %20 you should probably use " " (literal spaces) because it looks better and will be auto-converted, so no Duplicate Content issues.