Forum Moderators: open
Is there any way to prevent Google and the other major SEs from caching/indexing just certain parts of the page? I cannot use robots.txt or meta robots, the rest of the page has to remain crawlable and searchable.
If I use character codes to spell out the names/phone numbers, will that prevent them from being indexed? Or maybe there's some special tag I can use?
Thanks!
- writing the non-indexable content via external Javascript from a robots-excluded directory
- insert the content via a borderless iframe which conains a meta noindex element or is excluded by robots.txt
- use an image instead of text
Each of these alternatives have their own usability and accessibility downsides.
Note that using character entities won't prevent the text from being interpreted and indexed by search engines.
a) - warn people that posting information on the internet is very public, remember that it is not just the search engines that have a record of those phone numbers, all kinds of other people use robots to retrieve and store web pages for all kinds of reasons from the totally harmless to the very suspect.
b) - maybe a method where contact details can only be seen if a user is logged in, this would prevent most search engines from seeing the phone numbers and so on. (WebmasterWorld does a version of this, check out the pages both logged in and not logged in)
... however both methods may effect either the usability of your site and/or people's willingness to post.