Msg#: 4420786 posted 7:45 am on Feb 24, 2012 (gmt 0)
Msg#: 4420786 posted 9:35 am on Feb 24, 2012 (gmt 0)
i already disallowed it from robots.txt and noindex tag
Note that the robots.txt keeps Googlebot from spidering 1.html... which means that Google won't see the meta robots noindex tag. In that situation, if Google indexes any pages that link to 1.html, Google might return the url of 1.html in something like a site: search of the domain.
If you want to keep 1.html and references to it out of the index, my approach would be to use meta robots noindex and not to use a robots.txt exclusion. With the noindex visible to Googlebot, Google would spider the page but won't return any reference to the page in its index. Hard to say what they do with the information on the page... whether they use it to form any opinion about you... or just keep the noindex on record that the page isn't to be indexed.
Just putting rel="nofollow" on a link to 1.html is not an effective way to keep 1.html out of the index, as any other links to 1.html would allow the page to be found and indexed.
rel="nofollow" links are also black holes for PageRank, although it doesn't sound like PR is your issue.
Are you trying to keep the material out of the public eye, or keep it hidden from Google entirely?