Forum Moderators: open
Does the bot just look at the title and URL? Also, does google check url's that look similar for duplicate content?
I ask because in the highy competitive domain name world, there are probably different sites with similar url's.
However, there are some great research papers - written by Google engineers & staff - that you can read which shed some light on this subject.
Consider this research paper by Bharat [citeseer.nj.nec.com] one of the 'top' engineers at Google, imho.
Very good reading, and more than likely, will shed some very scientific light on the problem for you (read it myself a while ago, informative).
Alternatively, try checking out labs.google.com/papers.html for more publications by their staff.
Or...the answer would be nowhere but in the hearts & minds of Google engineers. I prefer to think that, by reading those papers, you get a much better idea of what Google considers duplicate & not.