Forum Moderators: open

Message Too Old, No Replies

Google still with old dmoz dump

         

SEOPTI

4:48 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Tons of dmoz porn spam sites at Google in different categories, like
disabled people, children (because of expired domains).

They don't mind to update their dmoz results. Worst ever!

troels nybo nielsen

4:55 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Is this Google stuff? Isn't it a dmoz problem?

Brett_Tabke

4:56 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



The odp has not updated the RDF dump since last sept when the software that builds the rdf failed. They are working on a solution.

bluecorr

5:00 pm on Nov 28, 2002 (gmt 0)

10+ Year Member



what does rdf mean? Does this mean it's now pointless to submit to dmoz and that sites won't show up?

engine

5:10 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



blucorr,
No, carry on submitting.
The RDF dump is what any other site uses to generate their DMOZ clone.

SEOPTI

5:14 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



bluecorr, I believe it's worthless submitting to dmoz because google
won't fetch the sites.
Maybe they will solve the problem next year, who knows.

JonB

5:20 pm on Nov 28, 2002 (gmt 0)

10+ Year Member



i think dmoz is slowly becoming obsolete but becasue of the power they have (other sites are worse) it is difficult to dump them. like 32 and 64 bit processors - tehnology is longer time here but intel etc have to overclook "old" processors to keep compatibility. new proccessors i belive require different "everything" so most if not all program would not work (well, you cna emulate but that is not the key) so we have to keep the old structure.

they cant come up with something for 3 months?

if we comapre nubmer of sites in google and dmoz and number of spam sites in both i would say in dmoz is more spam(spam including all those affilaite sites placed in categories by that category editors&their friends + redirects/expired domains)! not to mention that sites in dmoz were all hand picked so there should be minimal spam.

Lisa

7:17 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



This is horribly disappointing they can't get a new RDF dump, it doesn't take a rocket scientist to get some new hardware and get it working again. 10 days tops. Google should think about setting up their own directory. An Organization that takes more then 2 months is outrageous when their are countless people out there pinning expectations on it!

kstprod

7:24 pm on Nov 28, 2002 (gmt 0)

10+ Year Member



I have to agree how disappointing it is to have the rdf so late, especially to someone with a new site. Although I'm not complaining on this G update, my link for DMOZ sure would have helped, along with the Google directory listing.

Oh well, hopefully it won't be long, as they are working on it. Griping about it won't get us anywhere, so we might as well focus our efforts on other site stuff. :)

Hang in there, we'll live. It could be SO much worse, like, God Forbid, being dropped from Google entirely... :)

Karen

Dumpy

7:38 pm on Nov 28, 2002 (gmt 0)

10+ Year Member



I'm sure Google crawls the DMOZ directory and picks up everything in that manner. In fact, all the SEs that crawl sites are probably up-to-date. The sites that parse the directory using the RDF Dump are grossly behind in getting all the new sites since about Sept 17th....which means they are not appearing 100's of directory sites. It is estimated that well over 200,000 new sites are affected.

I would agree that if DMOZ were a profit center...the problem would have been fixed by NetScape within 24 hours. What disturbs me is that NO MESSAGE exists on the site about their problem. It is too bad that DMOZ is not truely an open source site...the problem could be solved by volunteers.

SEOPTI

7:52 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It's time for Google to replace DMOZ with a non-volunteer system with paid staff.

Dumpy

8:12 pm on Nov 28, 2002 (gmt 0)

10+ Year Member



I believe in the basic plan that DMOZ operates under...however, it needs to be disentangled from the AOL/Netscape alliance. In my opinion it should be spun off as a nonprofit corporation and become absolutely transparent and open source. The costs of operation are trivial without paid staff and with just a simple donation system and perhaps grants by corporations it could become the true powerhouse of the internet.

There is too much secrecy and lack of public relations for the weakening position it is devolving into under it's present management. The result of this present fiasco will be hundreds more directories changing over from parsing to crawling...and putting more pressure on their system.

rfgdxm1

8:29 pm on Nov 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>I'm sure Google crawls the DMOZ directory and picks up everything in that manner. In fact, all the SEs that crawl sites are probably up-to-date. The sites that parse the directory using the RDF Dump are grossly behind in getting all the new sites since about Sept 17th....which means they are not appearing 100's of directory sites. It is estimated that well over 200,000 new sites are affected.

I know for a fact Google does spider the DMOZ. Thus, one will get PageRank benefit from a DMOZ listing even if the RDF dump is delayed like it has been. However, if you do to the Google directory itself, it is NOT current.

steveb

2:21 am on Nov 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Sites listed since the rdf failure (for example one month ago) show dmoz backlinks in this update.

SEOPTI

4:32 pm on Nov 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Problem are the DMOZ and GOOGLE directory expired domains spam sites, they still pass PR
because still in the GOOGLE directory.

Of course they have been removed from DMOZ months ago, but GOOGLE didn't get this data.
All expired domains GOOGLE directory spammers must be really happy now.

Marcia

12:24 am on Nov 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If there's a domain that expired and the site is no longer appropriate for the category, just write the editor pointing it out. It only takes a couple of minutes to do, and helps lighten the load for them a bit.

dauction

6:50 pm on Nov 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Marcia,

"f there's a domain that expired and the site is no longer appropriate for the category, just write the editor pointing it out. It only takes a couple of minutes to do, and helps lighten the load for them a bit. "

thats a nice theory but in reality DMOZ editors that I have spoken to REFUSE to take out sites I have pointed out..

The problem remains...thosands of "volunteers" that their MAIN concern is inserting their own web sites.. I have a HUGE file of "editors past and presernt and their websites..the list grows everyday.

I see no concerted effort to "clean" up the rampant corruption.

Yes, of course their are some good editors..but by and large DMOZ is filled with cheats.

ciml

7:29 pm on Nov 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



dauction, I'm sure we'd all be surprised if none of the 53,280 ODP editors were cheats, just as we would be if they all were. The proportions really aren't Google news.

As Brett pointed out in message 3 [webmasterworld.com] of this thread, the ODP RDF dump problem seems to be the reason for the stale Google Directory listings.

Chef_Brian

3:54 pm on Dec 4, 2002 (gmt 0)

10+ Year Member



Hey Everyone,

Any news on the status of the "dump" .... sure hope this gets worked out in the coming weeks.

Brian

SEOPTI

9:09 pm on Dec 4, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



No news from Google and Dmoz.

SEOPTI

1:05 am on Dec 8, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Googleguy, can we get an update on this?

Robski

3:30 am on Dec 8, 2002 (gmt 0)

10+ Year Member



Glad to see this thread because I have been asking myself a few questions. Maybe someone here can provide some answers. How often does the Google directory get updated from dmoz.org? When was the last update? When did this problem with the rdf begin? When is the next refresh of the google directory supposed to happen? Finally, since google has the directory so prominently displayed as one of their main search tools, shouldn't they be doing something to help resolve this problem? This can't bode too well with their users for whom they so ernestly strive to bring "the most relevant content."

nebuhost

5:12 am on Dec 8, 2002 (gmt 0)

10+ Year Member



I wish they'd dump as often as I have been lately.

Its true I'd guess over 70% of dmoz editors are corrupt.

WebManager

2:55 pm on Dec 8, 2002 (gmt 0)



Google will use Yahoo! next - I'm convinced of it - it would be the only logical business decision.

g1smd

12:26 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Yesterday's RDF dump has only eleven errors in 460 000 categories. Just maybe the iterative hard work of try a fix and re-run, over the last couple of months is now nearing to an end. Lets hope for good news in the next couple of weeks. See [dmoz.org...]