Forum Moderators: open

Message Too Old, No Replies

When will the Google directory update occur?

         

NickH

10:03 pm on Feb 28, 2003 (gmt 0)

10+ Year Member



Now that dmoz is generating rdf data, does anyone know when Google will update their directory?

quotations

10:11 pm on Feb 28, 2003 (gmt 0)

10+ Year Member



That is probably what they are doing right now.

They may try to incorporate the 19 Feb version or there is a newer one dated 26-Feb-2003 11:33

lazerzubb

10:16 pm on Feb 28, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Yah, everyone can relax for a while longer. February is a short month, and when I said "take a few days off" in another thread, I meant that you could take a few days off ~GoogleGuy [webmasterworld.com]

And they will not have any bigger time differences between the directory update and the web update, so it's just to sit back and take a rest.

Also in another comment GoogleGuy stated, that the DMOZ RDF dump was not available when they started spidering, which probably means we will see a old RDF dump for this month's comming directory update.

quotations

10:38 pm on Feb 28, 2003 (gmt 0)

10+ Year Member



That would be very sad.

There are so many major changes since September 2002. The directory structure has been completely re-done in some areas.

Ceverett

11:49 am on Mar 23, 2003 (gmt 0)

10+ Year Member



Oif, this stinks, I've been in ODP very long time!

<jumps up and down and has a hissy fit>

Like a few years ....

g1smd

12:21 pm on Mar 23, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



There were no RDF dumps from 2002 late September until 2003 early February due to bugs, corrupted database entries, duplicate CatIDs, hardware problems, and various other problems. Since an RDF run was taking nearly a week, it was a long wait to see if it failed, then a fix was applied, and then you had to wait yet another week to see if it worked.

The RDF generating schedule was restored in early February, firstly with a special "noCatID" RDF dump, as there were still some problems with dublicate IDs in the database, and then with normal service once per week since then. There have been about 6 new RDF dumps produced after that.

Initially, there was a problem with some invalid UTF-8 characters creeping into the results, and that was fixed several RDFs ago as well. There is also an experiment to do some limited single branch RDFs which is ongoing.

The latest RDFs can be found on a new server at [rdf.dmoz.org...] rather than at the old location which is being phased out.

Google took a copy of one of the RDFs nearly a month ago, and updated most of their Directory database a week or so back. Normal service has resumed some time ago. The ODP has no control over how downstream users use the data, nor when and how often they update.

So, what exactly "stinks"?