Forum Moderators: coopster
Now I'm thinking of lots of other ways php could benefit the site and probably make updating certain aspects of the site easier.
The site is live, yet very young. It has been crawled by gbot and should be included in the next update.
So, would it be wise to change the file extensions at this point and go full php for all pages or should I stay static? I've read around here that people are having no SE problems with php sites. What are your views on this subject?
Thanks for any input;)
Andreas
I've read around here that people are having no SE problems with php sites. What are your views on this subject?
Certainly nothing wrong with telling your server to process .html as .php, except some relatively minor overhead in that it will now parse all .html pages looking for PHP directives before serving. Since all my pages have PHP in them, this isn't a big deal for me, but it's something I've at least heard bandied about.
I do know that google has definitely crawled and indexed some of my dynamic pages, based on the fact that my résumé is the number 3 hit for a search on the law firm I used to work at, and my contact info page is the number three hit for my name. No intentional SEO on either one, and they'll never show up without the right query string. The query string itself, however, has been the same for several years and scross three different hosts.
AddType application/x-httpd-php .php .php3 .phtml .html
Thanks for the help. Now I can have my cake and eat it too.
One quick, off topic, question: Do spiders follow form action links or should I use querystring to pass info?
While this is true, I would not expect spiders to follow the URI given in the action attribute which specifies [t]he program that will handle the completed and submitted form [...]. The receiving program must be able to parse name/value pairs in order to make use of them. ([w3.org ]) Since a spider will know that the program will expect some parameters that will be passed by the form, it will not follow that URI.
This behavior has nothing to do with whether the action URI contains a query string or not. (That the form may be submitted via the GET method which will cause the data to be added to the URL as a query string, is just a coincidence. This fact cannot be used as an argument when deciding whether spiders will follow the URL given in the action attribute. It does not make sense for the spider to reach that decision based on a fact of the underlying transport protocol.) Furthermore this behavior is entirely unrelated to the question of whether a spider will follow a link to an URI that contains a query string (which it will in most cases).
For navigation use links which the spiders will follow, to receive user input use forms which the spiders wonīt follow since they donīt have to contribute valuable user input (which they are very well aware of. NOT really but they wonīt pretend they have something to contribute since SEs have no interest to do that.). Doing this you wonīt have any problems.
Frankly, the only reason I can think of to make a spider do such a thing is if its intended purpose is off-line browsing, rather than SE indexing.
For navigation use links which the spiders will follow, to receive user input use forms which the spiders wonīt follow since they donīt have to contribute valuable user input (which they are very well aware of. NOT really but they wonīt pretend they have something to contribute since SEs have no interest to do that.). Doing this you wonīt have any problems.
Furthermore this behavior is entirely unrelated to the question of whether a spider will follow a link to an URI that contains a query string (which it will in most cases)
We all know for a fact that no strings is always better. So if that is absolute then as we get farther from that model the more likely we are to find problems.
Clean url's unless absolute necessity says otherwise.
it definitely does. I have worked with sites that had great rankings then moved to a dynamic set up with long, 2 to 7 var, strings in the url. They effectively removed every listing they had.
Just by rewriting them to dir style has solved the spidering problem. It still creates the look of being 2 to 7 levels, or more, down in the directory structure but it does solve the spidering issues.
And frankly I donīt see any point in having a query string. One RewriteRule takes care of that. If you donīt have mod_rewrite, install it. If your hosting company does not have it, switch hosts. If you are running IIS you seem to have the money to buy a IIS rewriting engine.
dingman, I didnīt get that you were only talking about the truly ambitious spider author. ;)
Andreas
If you donīt have mod_rewrite, install it. If your hosting company does not have it, switch hosts.
How does one use rewrite rules to eliminate query strings?