Forum Moderators: open

Message Too Old, No Replies

Yahoo MM Crawler Lobotomy?

MM crawler looking for lots of incomplete URL's

         

Constantin

12:15 pm on Jun 19, 2004 (gmt 0)

10+ Year Member



Over the last week I have been observing the new and improved Yahoo multimedia crawler peruse my site. The IP address and all that point right back at Yahoo's domain range, so I'm not dealing with a fake UA here.

Every day, the Yahoo-MMCrawler/3.x is causing something on the order of fifty 404 errors on my site. These errors are caused by Yahoo dropping intermediary directory names from URLs, i.e. something like the MM crawler looking for /foo/index.html instead of /foo/bar/index.html.

Is this a widespread issue or is my site unique in this regard? Usually, the only 404's that are generated on my site are from external links gone bad. Internally, it is quite clean and consistent.