Page is a not externally linkable
dstiles - 5:52 pm on Nov 11, 2010 (gmt 0)
As I said, I have preview bot blocked with 403. They are either coming at the site through the punter's IP or are sucking via googlebot OR through an unrecognised IP using a "real" browser identifier.
The latter may be true. The punter option seems more likely EXCEPT I can't see any proxying of the punter's IP so if that is true they are also falsifying the source IP, which I can't see happening unless they have become really devious!
As noted above, at least one of my sites (make that at least 2 now!) shows pics and furniture when it shouldn't.
A client's site has pics in cache view but not in preview EXCEPT this site did not block pics until quite late (probably May 2009) and those images ARE shown, even though this breaks the recommendation in robots.txt. This is difficult to determine absolutely since pics on some pages are old and some new.
Another client site shows pics even though robots says not to BUT only on some pages. These pics (AND furniture) have always been disallowed but again are in cache view (so google has been breaking robots.txt protocol for some time... never thought of that before regarding cache).
I'm guessing here that the missing pics are probably due to google not having scraped them yet.
Another client site we run has several iframes per page (not on all pages). This seems to have caused only minor problems to preview, which shows the full iframed page WITH contents for specific keywords (but not (always?) the pics but always the furniture). Furniture is shown but not product pics on the pages we've seen so far but again may be scraped yet for preview.