Forum Moderators: open
From their front page: Google - Searching 3,083,324,652 web pages
Let's say a mere 1/2 of 1% of these involve questionable techniques under their guidelines (3,083,324,652 x .005 = 15,416,623)
and only 1/2 of 1% of these are reported (15,416,623 x .005 = 77,083)
Assuming an editor can keep the pace of reviewing/documenting/processing 1 page every 15 minutes and given a 2000-hour work year, you're looking at roughly 10 man-years of editorial work.
And that's at extremely conservative .005 multipliers. I'd hate to be coping with what I suspect are the real numbers.
Assuming an editor can keep the pace of reviewing/documenting/processing 1 page every 15 minutes and given a 2000-hour work year, you're looking at roughly 10 man-years of editorial work.
Could maybe also assume that Googles algo is less than perfect, and those on the other side of the fence as "the spammers" are sometimes unfairly penalised and making google help desk (and editors) well aware of it.
So you can double the workload of that editor straight off :)
No wonder why the PR rep here always says, 'they prefer scalable spam fighting' ...it'd be impossible to fight it by hand.
...also, it reminds me why so many engines went into the toilet - they forgot they were search engines, - not directories.
Heavens! Up to 154,000+ reports to look into.
I'd be there are a lot of people who also report competitors' sites that may be pushing the edge, but aren't actually "bannable" spam. Probably less than the actual spam numbers though. How 'bout we slap a purely theoretical 25,000 on top of the previous number to cover false-alarm spam reports (that nonetheless need to be checked into)?
Now were talking around 175,000 "reports" to be checked out of the current index. And we're up to what... around 23 man-years of work at the previous estimate of 1 report investigated per 15 minutes, during a 2000-hour work year?
Fine. I'll give them another half hour to remove my competitor from the index in that case... ;) They'd better get cracking though.... who knows how many new spam/false-alarm-spam/I'm-not-a-spammer pages will need to be checked after the next update!
IMO very low - you might be able to take that 175,000 figure and quadruple it and still be on the low side.