What do you mean, the software won't let me post five pages of blather in a single serving?
Chop, chop, slice.
Chapter One:
A year ago I did not know google existed. It’s just a place you go to look up stuff. Things I have learned in recent months:
1. A Watched Search Engine Never Indexes
I need to change my signature in another forum. So why don’t I? Because when I do, half a dozen people will race to google to look it up. (I know this from past experience.) The complete phrase does not currently exist anywhere on the Internet. So I added a paragraph to the appropriate page, sat back and waited for g### to re-index it.
For the next seven days, they tormented me by repeatedly crawling every single page in the directory-- except that one. They have now moved to Phase Two: putting it in the “We’ll index it when we get around to it” bin. Unlike WebmasterWorld forums which apparently do not trust their members to decipher a three-letter abbreviation, I do not find my words in g### an average of five minutes after uttering them. Unless, of course, I uttered them here.
2. An Unwatched Search Engine Always Goes Haywire
I have a page that I’m particularly fond of. Its title is, by necessity, a fairly well-known phrase. So it should be pretty gratifying that g### puts the page at something like #5 or 6-- slightly higher in Canada-- out of something in excess of a billion hits. (For you Europeans, that’s 10^9, not 10^12.) This ludicrously high number is because every single word in the phrase is a short, common word-- most of them so common, they wouldn’t even be listed as keywords. It would be hard to write a page of text that didn’t use them. (I checked. I’ve already used all six.)
Except, except, except... Except that I know darn well the people who enter this search string are not looking for my cute little page, or anything like it. They know it too, and stay away in droves. They’re probably looking for the locus classicus of the phrase, and swearing at google for giving them a bunch of irrelevancies. Sometimes I’ve edited pages after looking at search strings and figuring out what kind of information people are actually looking for. Not this time.
3. Put ’em Back the Way They Was
I got used to glancing at the keyword list in GWT, the one buried in the right column of the “dashboard”, and shrugging it off. I write a lot of linguistically oriented stuff, so it is probably inevitable that two of the top five were “word” and “verb”. (The really good words like “morphophonetic” are nowhere to be found.) Also “it’s”. Really. It’s. IT’S. This would be nice if I were selling ice cream treats in San Francisco, but otherwise it doesn’t get you far.
Then one day I looked-- and every single word was different. Most of them were personal names that have nothing to do with me. What the bleepity bleepity? G### had compiled a fresh Keywords list out of just one page-- and that one page is a MiSTing, so half the words aren’t even mine. (The original author can only bow his head in resignation. He wrote it in 1804, and by 1811 it had been successfully pirated.)
I’m wrong, of course. The familiar old monosyllables are still there, just 10-20 steps further down. It happened because this page, which google has somehow only just discovered although it has been essentially unchanged since early 2009, is the second-fattest page on the whole site. The #1 fattest page is an e-text made from a massively plagiarized novel whose favorite source was the book used in #2. It took a few more days for that one to be digested, but I now have even more of the same names. And some others, equally meaningless to me. Page #4 by weight is also an e-text. It’s a children’s book about rats, so it took some close searching to establish that they’ve keyworded it. I wonder why Whiskerandos rates, while the equally prominent Dwishtswatshiksky doesn’t? The biggest oddity was #3, which contains at least one name that should count for something. That’s assuming for the sake of discussion that a former prime minister of Canada carries more weight than the word “and”. Guess I should have slipped his name into the text a few more times.
To be continued...