Forum Moderators: martinibuster
Question: Are you allowed to have adsense on a page blocked by robots.txt? (I think that's covered and the answer is no, correct?)
So essentially, google will have to spider ever more urls. An insane number. If a bot gets trapped in some loop on dynamic pages it can get crazy with useless URLs. How can G possibly keep up?
Then when you have a change to the page, it is important that mediabot refreshes the page for relevant content in a timely manner.
If it was just about a visit and they didn't need to keep any of the data, it would be easier. But the Mediabot probably has to be faster and more complete than the regular googlebot. And it needs to keep data on each url it visits so it can provide relevant content.
To me this will be one of the biggest challenges google faces in scaling the system.
People have seen lots of problems with session IDs - the same thread will be spidered repeatedly with each viewer's session ID. The bandwidth will eventually ad up, especially on super-active SID sites.
The call to the Mediapartners bot happens when someone views a page.
Displaying the ad is completely a different process that doesn't involve the Mediapartners bot, so even if the rogue bot did go haywire, no Mediapartners issues would arise.