Forum Moderators: open
My index.html is made of only the <head> (including meta info) and the following script:
<script>location.href = "index.asp"</script>
Now, when I do a search in Google, I get strange results. The index.html gets indexed all right, but not the .asp, although at this stage there is no query string. But it seems that the spider goes to dynamically generated pages nonetheless, because pages are indexed which are linked to only from there.
Altavista on the other side lists only the index.html.
Anyone got a clue what the problem is? A solution would be nice too :-)
So if you have index.html pointing to sub.html with an <a href> and this points to subb.html with an <a href> and this points to subc.html in JavaScript:
Your visitors will see:
index.html -(a)-> sub.html -(a)-> subb.html -(JavaScript)-> subc.html
The search engines will see:
index.html -(a)-> sub.html -(a)-> subb.html(Contains JavaScript-ignored)
I also tried to search for direct links from other sites, but there are none.
From #6 of Google Webmaster Information [google.com]:
It is almost impossible to keep a web server secret by not publishing any links to it. As soon as someone follows a link from your "secret" server to another web server, it is likely that your "secret" URL is in the referer tag, and it can be stored and possibly published by the other web server in its referer log. So, if there is a link to your "secret" web server or page on the web anywhere, it is likely that Googlebot and other "web crawlers" will find it.
Could that be the explanation for it?