Forum Moderators: open
already thinking to express submit before Y! change their prices
I think a mixed solution should[will?] happen.Commercial -> Yahoo
Non-commercial -> Dmoz
I doubt Google's stupid enough to think that's a functional distinction. The line between commericial and noncommericial sites (and categories) is too fuzzy to assume all the editors will see the same line, and hiring referees to manage the line will just suck up Google's money.
I doubt it. If Google replaced ODP with Yahoo!, Google would become dependant on Yahoo!, and Yahoo! could pressure Google to do what it wanted by threatening to terminate the relationship and take away its directory. With ODP, an open source solution, even if AOL, the parent company of Netscape, terminated its relationship with Google and the ODP, Google would retain rights to use the existing ODP data.
By using ODP instead of Yahoo! as its data source, Google has more control and can offer a consistent service to its users.
The only loser would be ODP, but I doubt aol will care much.
>>There are obvious, automated ways to doing this.<<
You are the only person in the world who believes this.
Please describe your "automated" method for separating commercial and non-commercial sites. I'd really like to see it.
I talk of course about an already Google-ranked site.
Examples of my absurd automated methods:
1] -> Amount and 'area' of queries that, in the real searches, give this site high in the serps.
2] -> [less relevant] Maybe this site being listed in a 'commercial' yahoo dir.
cminblues
Please describe your "automated" method for separating commercial and non-commercial sites. I'd really like to see it.
i'm no programmer but my method would be.
define commercial for a start then
spider the pages...
mark down as commercial any pages with the following...
affiliate links, ad banners from known syndicators, shopping carts (any pages linking to ssl parts of a site could be a clue)
the list goes on and like beachboy i haven't given the concept my full attention either.
affiliate links, ad banners from known syndicators, shopping carts (any pages linking to ssl parts of a site could be a clue)
Commercial sites with shopping carts:
Make a Wish Foundation [wish.org] - Buy shirts & other commercial stuff
American Red Cross [redcross.org] - soliciting your hard earned cash
Commercial site with affiliate program:
American Cancer Society Master Card [cancer.org] - Don't the best affiliate spammers get the company to rebrand/private label the offering? Makes it easier to trick those ODP editors, right? They even set up a blatant doorway domain:
[americancancercard.com...] We could check for well known tactics like that, too.
Ad banners from known syndicators:
Every free-hosting site on the freakin web.
Maybe it's not so easy to automate this...
Did somebody say "humans do it better"?
By your definition of commercial sites, you've just disadvantaged the following blatant commercial endeavors. etc etc
as you well know any sophisticated system would have many many factors included not just a simple ban shopping carts algo, exceptions can be found programatically using keywords, lists of known charities and so on, given the inclination i think google could do it if they chose to prioritise the project
Ad banners from known syndicators:
Every free-hosting site on the freakin web.
yes but most free web space is provided by a handful of well known domains, it would be easy enough to filter them into the non commercial catagory if they didn't trip any other alarms
Maybe it's not so easy to automate this...
Did somebody say "humans do it better"?
i completely disagree, of course human reviews and catagorisations are good, but
1. genuine errors occur
2. fraudulant editors exist
3. sites can change
an automated system would also make mistakes but i expect no more than humans would.
<aside>
i cannot believe there is anyone in this business who doesn't admire the efforts of the genuine editors in the odp and elsewhere
BUT i also know that you don't have to look long in the 3 main directories...
ODP, yahoo and LS ... to see that there are large numbers of very commercial sites listed in either non commercial catagories or in the case of the pay directories are listed for free.
sometimes, just sometimes humans can stink
as you well know any sophisticated system would have many many factors included not just a simple ban shopping carts algo, exceptions can be found programatically using keywords, lists of known charities and so on
Absolutlely agree with you. I'd expect that to have any chance of properly deciding if a site is commercial or non-commercial, the sophisticated system would need to have maybe 100 or more factors. And then they'd probably want to monitor and tweak it every month.
Hey Larry, what if we set the affiliate link red flag at 3 links this month?
OK Sergey, but we'll have to cross reference that with the "Is There *Enough* Unique Content" filter so that we don't knock out too many Britney Spears fan sites like we did last month.
I think you nailed the key to this sub-topic debate right here:
if they chose to prioritise the project
To have a chance of accurately deciding if a site is really a non-commercial site is going to take a ton of money/effort/work/tweaking. I'd bet it'll take a lot more work than that invisible text filter. And we all know you never see sites with invisible text anymore on G.
Littleman said:
The Y! directory on google will be an incentive for webmasters to keep signing up with express submission. This is a way for google to scratch yahoo's back without costing anything.
But introducing the sophisticated system required with the AI to discern commercial *intent* would cost alot.
We may see Google drop the ODP for Y! but I'd say that the ODP/Y! combination that you've described would be too much messing around for too little benefit.
Maybe Google could create there own directory, this way all sites might be able to get listed.