Forum Moderators: Robert Charlton & goodroi
I've been reading this forum for a while and it's been a really great help, this is my first post I am truly stumped on this!
I think this is something googlebot specific.
I have made a dynamic website using php/mysql and tried to make it SE friendly using the rewriteengine in the .htaccess file to make my dynamic pages look like static pages.
The implementation works as I can follow the links with my browser and a few spiders have been through the links.
The problem is, Google just won't follow the links. I thought I was suffering from the "sandbox" effect, but the other night I created a single static page on my site, with a link to it from the main page(which has been indexed) and google spidered this page within 6 hours.
So the problem doesn't seem to be the sandbox effect, just that googlebot won't follow my RewriteEngine generated links.
I would be gratefull for any ideas that anyone has about this,
Thanks,
Nam
Thanks for that - I don't have a robots.txt file, do you think it would help?
I made another couple of static pages last night with normal links to them and they havn't yet been spidered - maybe it is just the sandbox. If the new static pages get spidered i'll know it's definately something to do with the rewriteengine generated links.
I've taken the robots meta tag out of PHP and put in in as straight html because of the quote marks issue, maybe that will help
If the url's are to deep, i've got a bit of a re-design on my hands...
Cheers
Google doesn't give a hoot about your directory structure it cares about your link structure.
Home page is first level, pages linked to from the home page are second, and so forth.
Do you have access to your server logs .... see exactly what G is indexing and check your robots.txt file.
In fact why don't you post your robots.txt file if it isn't a concern for you to do so.
I've not got a robots.txt file yet - do you think it would help?
Thanks to wizard I've gonverted the links to use dashes between variables instead of slashes, not googlebot yet but Mediapartners-Google/2.1 is alot more active eventhough i've been displaying adsense for weeks
Are you certain there is no robots.txt file on your site .... in a browser do a http//www.example.com/robots.txt where www.example.com is your site.
I have never seen G not follow links unless told not to you have only a few ways of telling G's bots to hold off.
1: robot meta tag in pages remove it default is for access.
2: robots.txt file remove it default is for access.
3: .htaccess banning.
4: server firewall rules.
5: script programming.
6: active conditional html.
Other than we can't schedule the bots visit I haven't a clue as to what is going on.
Here is my robots.txt file the site gets plenty of bot visits every day.
User-agent: *
Disallow: