Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Weird indexing... Google indexing pages that do not exist

         

Jigit

8:24 pm on Jun 9, 2008 (gmt 0)

10+ Year Member



I am getting weird indexing from Google. MSN and Yahoo do not have thie problem. The page is returning the main index.htm content, which has broken pictures and css file reference.

/index.htm/country1/country1/widgets/city/widgetname/country3/jp/index.htm
Http Code: 200 Date: Jun 09 15:53:06 Http Version: HTTP/1.1 Size in Bytes: 3452
Referer: -
Agent: Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)

Anyone had similar google bot results? What could be causing this?

Here is the structure of the site
index.htm
country1/widgets/city/widgetname/index.htm
country2/widgets/city/widgetname/index.htm
country3/widgets/city/widgetname/index.htm

[edited by: Robert_Charlton at 9:25 pm (utc) on June 9, 2008]
[edit reason] removed specifics [/edit]

tedster

3:13 am on Jun 10, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Are you talking about spidering - or actual indexing? Because you show a server log snippet, I'm thinking you are talking about spidering.

Notice that your server responded with a 200 HTTP status. That is, your server told googlebot that the url it requested actually DOES exist, and gave it content in response. You'll want to address that issue as soon as you can.

Check out this related thread - [webmasterworld.com...]

As I mentioned there, it can be very hard to know where Google gets a url. It might come from toolbar data, direct submission, someone else's page that is not now indexed but once was, a cloaked page, a Google test of how your domain handles this "invented" url -- probably more, too!

There is an opportunity for trouble in the SERPs - especially if this happens a lot - so make sure you handle the technical problem that allows your server to say "yes" when you want it to say "no".