Page is a not externally linkable
BigDave - 8:40 am on May 20, 2003 (gmt 0)
Let's suppose that the new algo with major changes needs the index data in an entirely new format. They changed the format with the latest crawl data at the time that they started testing the new algo in February. Now suppose that the deep crawl develops the equivalent of a diff between the last index and what will be the new index. With the march and april crawls the diff that is developed for the old style index will not apply to the new improved style index. Google decides that the new algo is a major improvement. They now have a hard choice. Move the new algo with old data, and then get the diffs straightened out. Or they can build and move a whole new index from scratch and still probably end up at least a month behind, due to the complexity of the process. So they decide to do the massive migration of the old data, and work on getting the diffs to apply to the new format. Depending on how they have the data stored, they may have to develop a March and an April diff, they may be able to do just an April diff, or even jump to a May diff if they decide to do it that way. Now here is another fun part. This would be the first time applying that diff to the new format. It will have to be tested first, which will make it a slower process than the old updates. It is unknown how much of the 2+ weeks between the deep crawl and the update was required to generate the diff. They may have already started working with the march or april data. We do not know. So yes, I can easily explain the February data being used for the current update. I may be wrong in the specifics, but this scenario is more likely than never having a deep crawl again.
This is all hypothetical. I am just presenting it as a possible option.