Forum Moderators: coopster
The next question is disguising the URL. Without having access to ForceType and mod_rewrite functionallity I can change the path to foo.com/script.php/1/2. So what would be nice to know is do SEs stop at the? or the .php and does is version above searchable?
Finally, if I completely disguise the URL using whatever combination of devious means to give me foo.com/script/1/2.htm does the SE have any way of knowing this is a dynamically generated page. And if not is this something they are planning to start checking up on?
Thanks
Chris
I cannot answer for all SEs, but there seems to be an understanding that Google reads links with up to two parameters. Sometimes three, at least i've seen threads mentioning it.
Both of these seem spiderable, no problem AFAIK :
- foo.com/script.php/1/2
- foo.com/script.php/1/2.htm
>> Does the SE have any way of knowing...
SE's dont mind dynamically generated pages AFAIK. Spiders don't like getting lost in a web of URLs that look different but point to the same content, but that's something else.
To answer the question: Yes, anyone can tell. Refresh the page a couple of times, and discover that the last modified date is always now, then you will know. Of course you can generate static pages in fixed intervals, they don't have to be generated on the fly, then it's not that easy.
But again, SE's don't mind dynamic, they only dislike having their spiders shuffled around by URLs they dont "understand". Less parameters generally leaves less room for errors.
/claus
<added>a dot is a legit character in a file or directory name, just created /abc.def/ to test it, although i rarely see it used like that
</added>
[edited by: claus at 10:23 am (utc) on July 18, 2003]
You might like to try a site search on "dynamic pages", plenty of threads on how spiders handle complex URLs.
Say I have a dynamic page foo.com/script.php?x=1&y=2. How far will various SEs read this. Up to the .php or further.
In general, spiders are getting better at dealing with scary looking URLs. People tend to advise against going over 3 variables, and to avoid anything that looks like a session identifier.
The next question is disguising the URL. Without having access to ForceType and mod_rewrite functionallity I can change the path to foo.com/script.php/1/2. So what would be nice to know is do SEs stop at the? or the .php and does is version above searchable?
SE should take the whole thing, it shouldn't be a problem, although I've no hard evidence to be sure. Somebody else may be able to confirm.
Finally, if I completely disguise the URL using whatever combination of devious means to give me foo.com/script/1/2.htm does the SE have any way of knowing this is a dynamically generated page. And if not is this something they are planning to start checking up on?
Other than circumstantial evidence, there is no way to tell outside of your server whether a page is static or dynamic.