Forum Moderators: open
the particular string for the page exists as a singular entity, therefore, the page exists as much as any other page exists. The spider grabs the browser output that is uniqely identified by that url.
I always found that alltheweb was actually one of the better spiders with dynamic urls.
ok
the variables to create the page string is big. we are talking about over 500k cells in various db´s.
so what are the tricks to get these variable strings listed in alltheweb?
less is always more when talking about query strings in general. Have you ever looked into tools like mod_rewrite? Take a look at this thread.
An Introduction to Redirecting URLs on an Apache Server [webmasterworld.com]
That thread is specific to apache but iis has mod_rewrite as well though I am not as familiar with it. They key is to either keep the query strings to minimum and when unavoidable or there are indexing problems a lot of people then use mod_rewrite.
Looking at good interanl linking structures always helps too.
None of this is really specific to ATW but is very good practice for all spiders.