Forum Moderators: Robert Charlton & goodroi
When I now search for keywords I notice that there aren't any php files in the search results.. only htm/html.
Has anyone else seen this problem? And do you know whether google will fix this?
I hope google will go back to normal and will recognize php pages again, because now my amount of visitors has dropped seriously
Last SERP shuffle put me ahead of many of my competitors, and it seems to me that there is no penalty of any kind related to the use of PHP.
When used correctly - with the exception of a question mark in the URL, It should appear to be a HTML page when viewed by Google.
I assure you - Google is still indexing them.
Google's guidelines discourage the use:
[google.com...]
You can find lots of information in your favorite search engine using a query like:
using mod_rewrite php session ids
Apache might be a good place to start and there is a webmaster's toolkit tool that might give you some ideas.
Thanks for all your help!
Laura
When I now search for keywords I notice that there aren't any php files in the search results.. only htm/html.
From Google's Webmaster Guidline.
I agree with blue_eagle. If you dont want to use a lynx browser use the sim spider (google it).
Check if the url contains sessions IDs that are not visible when normally browsing the site. Often php will use cookies for sessions and if cookies cannot be used on the client (eg. a robot) then it will append an ID to all internal URLs on teh site. This could be the problem.
"why do people post nonsense like this?
Why do people post useless posts like this? No help to anyone!"
My apologies to the original poster...(had a bad moment there...with just a residual twinge of my original response)
always work to make your php strings user friendly and make sure your?id= string has no more then 2 variables...
you should be okay for indexing...but the more descriptive the better...
If you format it this way:
?category= or?productid= etc.....
you will have a more descriptive url string..
Both Unix and IIS servers offers ways to achieve this...
The issue with this has always been that if a spider comes along and sees a question mark in a url string and then a series of equal signs...the bot sees this as a database reference (which it is) and if the bot wiggles its way into your database...it may not be able to find its way out...and you do not want a bot caught in your database in some horrible death loop...can bring down not only your site but an entire server as resources are eaten up...
However, at one point, I accidentally happened to use very high values for my ID variables, with figures like 32967, and at that point, new pages never made it into the index.
And based on what GG has said here previously, that ID strings might be confused with session IDs, it actually makes sense. No session ID would be as low 2, 3 or 4 digits.
Therefor, I have decided never to have more than 9999 content pages on any of my sites ;-)
i was looking at a link request the other day from a php site.
Somehow Google had got confused with some of his pages.
Searching for www.hissite.com/pagename.php Google was returning a completely fully indexed different URL.
How could this happen? a mod_write gone wrong redirecting the old url to the new with a [RP] flag. Or is this something more sinister.
i asked the webmaster about this, and he said he wasn't using any rewrite software.
Has any one got any thoughts on this?
Vimes.
All of my asp pages get indexed fine -- though I am looking for a solution to re-write my URLs without mod_rewrite but an ISS rewrite alternative (as my hosting doesn't support mod_rewrite on an IIS server).