Forum Moderators: open
why not google define a "Recommendation(or rule?) of dynamic URL for search indexing" to help google to index those pages?
rule 1:
dyn-pages should provide "dyn-url-keys" info about its QUERY_STRING, works like "Vary" http header used by cache-proxy
dyn-pages which follow this rule may be FINE indexed, and marked as "well designed for SE index" dyn-pages
example page: http://sample.com/thread.php?id=1&page=1&forumpage=2
id=1 means threadid=1
page=1 means the first page of this thread
forumpage=2 means user click this from page 2 of the forum
and the key should be "id" and "page" not "forumpage"
then, put a tag
<meta http-equiv="dyn-url-keys" content="id;page">
in html head
and now, google is able to index this page as it is http://sample.com/thread.php?id=1&page=1 not http://sample.com/thread.php?id=1&page=1&forumpage=2
rule 2:
dyn-pages requested by non-existing/invalid value of "key" should return 404 for googlebot, and may return friendly 404 page for users.
dyn-pages follow rule 1 but not this rule, when reported by user, will be punished by google, same as doing whatever cheating google, this make web-developers stick to what rule expect, and help google-bot to remove cached but no longer exist content
NOTICE: the above message just a suggestion to google, and for discussion here, it may not used by google
any more idea about this same topic, reply here pls :)
A <meta http-equiv="dyn-url-keys" content="none"> option would also be nice - ie ignore the whole querystring.
I have some pages that use querystrings purely for tracking, the content is always the same. I am concerned about Google indexing every variation of the querystring and ending up with dozens of identical listings.
to amanda21:
should be <meta http-equiv="dyn-url-keys" content="">
instead of "none"
google care about not only how to index, also about how to anit-spam, anti-cheat
anyone who has idea about anti-* on this method, contribute pls
help google to index dyn-page also help yourself
that's why i post rule2 :)