tedster - 12:37 am on Feb 15, 2011 (gmt 0)
Welcome to the forums, gshannon, and thanks for the protection code. You're exactly right about how to identify a direct googlebot request compared to one through a proxy.
This case is still rather different, don't you think? The original Amazon forum DISALLOWS googlebot in robots.txt. So they're not getting crawled by Google at all - by intent.
Given that it is the actual support forum Amazon's cloud service - their Simple Storage Service itself - it looks like they don't want Google to rank them. Or maybe the robots.txt was hacked.