lucy24 - 7:04 am on May 11, 2012 (gmt 0)
Redirect the files that need to be redirected. Serve 410s for the files that are gone and have no close replacement. Serve 404s for files that don't exist, never did exist, never will exist...
Paradox. Even though 404s are listed as Errors, g### wants you to have them. That is, they want evidence that your site is able to generate a 404 response.
Unfortunately they've given the job to the same googlebot that will continue crawling pages for years-- literally-- after they've ceased to exist. Gee, maybe that 404 was a mistake. Better try another one.
And then there's the whole issue of g### trying to read URLs out of things that were never intended as links in the first place. Lots of recent threads about that.
The percent encodings really aren't anything special. They're simply non-ASCII characters. Correction: non-alphanumerics plus a very short list of others.
This one's fun:
Notice the stray 28 and 29? Something went severely goofy there, because %28 and %29 are (parentheses).
Now, let me know if you ever get a request containing percent-escaped characters in the exact form