Sgt_Kickaxe - 9:32 pm on Sep 2, 2011 (gmt 0)
The canonical might help in some cases, but I already have canonical set and I still get crap in the index if I don't specifically block it out. Google is not 100% reliable in this.
Google is 100% reliable in that they will crawl all available data, the question is will they obey webmasters and not crawl what we say don't crawl. The answer appears to be no. I've set up several honeypot pages to see what Googlebot does in reality, the only unbiased answers come from testing for yourself.