I have two duplicate issues in WMT. One is due to spaces the other due to a single apostrophe.
Space, %20, %25%20, +
The site displays info on sportspersons. URLs for each sportsperson page is like this:
/sportsperson.php?sport=hky&name=Joe+Dude
there are duplicates for
/sportsperson.php?sport=hky&name=Joe%20Dude
and
/sportsperson.php?sport=hky&name=Joe%25%20Dude
Which should I be using, and how can I rewrite it?
Single Apostrophe
SQL injection is often targetted on the FAQ section:
/faq/index.php?id=200' and ' union .....
This is blocked but in the generated sitemap this is recorded:
/faq/index.php?id=200'
which will duplicate with the genuine request:
/faq/index.php?id=200
How do I remove the single apostrophe from the end? Note that this only needs to happen in the /faq/ directory.
Total Rewrite
I appreciate that when setting up the site I could have made it more SEO friendly by not using get vars in the string. Would it now be too late to rewrite these:
/sportsperson.php?sport=hky&name=Joe+Dude
/sportsperson.php?sport=fball&name=Harry+Lime
to these:
/sportsperson.php/Joe-Dude-hockey-player-profile
/sportsperson.php/Harry-Lime-football-player-profile
In total there are over 100,000 names. Would implimenting this change be too much for
a) the search engines - would 100,000 301s cause a penalty?
b) the server - what is the impact on performance?