Welcome to WebmasterWorld Guest from 126.96.36.199
Forum Moderators: open
Google reads the entire file, and any activities that take place at that point seem to happen to the whole file. Then they store 100k, and everything after that seems to be limited to what they have stored.
Your best bet is to make sure that anything you care about having in the index is before that 101k point, but if you need a file to go past 100k for some other reason, then go ahead.
BigDave - I have seen the anchor text count well beyond the 101k size as well - I think a lot is based on the originating pages theme and PR as to how much they get counted
I also seem to recall that they showed up as backlinks. But I am far less sure about this. Backlinks are so iffy anyway that I never really put much faith in them.
I have no desire to test whether anchor text counts or not. I have never concerned myself with anchor text.
I no longer have that sort of navigation on my system, so I will no longer be able to verify that this currently applies. And experiment from a year ago are not guaranteed to still apply today.
What I can add is that on my site there are 3 reviews that I could find where the pages went past 100k. When I checked the links at the very end, those pages did not show up as backlinks, but neither did half the other links that should have.
Actually, the largest page that I could find that was listed as a backlink anywhere was only 64k. YMMV
Mmm. PDF files can have different length cutoffs. I should have said "run an experiment on HTML files and let us know what you find out." :) Also, as BigDave points, outlinks could have an effect even if they didn't fit into the cache. I don't know who BigDave is, but he is wise in the ways of Google.
I have always suspected the BigDave was an alias for GoogleGuy.
Naw, The last job I had in the silicon valley was 15 years ago, and I ain't going back. They will get my resume as soon as they open an engineering office on the Oregon, Washington, BC or New Zealand coasts.
It's just that 25 years of writing firmware, simulation and system diagnostics gives you a lot of experience in figuring out how things work.