Welcome to WebmasterWorld Guest from 23.20.37.222

Forum Moderators: Webwork & skibum

Message Too Old, No Replies

DMOZ - Google

When will Google update its directory?

     
4:42 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It's taken longer to get into the Google directory than it has to get into the ODP.

Made it into the ODP in April, two months after submission. Thank you DMOZ volunteers! :) Still a no-show in the GD & here it is July.

A thousand pardons if this is taboo here, but any clue when Google will catch up? This is atypical right?

4:56 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Google Directory used to be updated a few days after the last data center was updated (more or less the same time as the deep crawl started), if it was updated. But sometimes it wasn't. That also depends on the availability of a new RDF dump of course. I got the impression that the latest RDF is more recent than the last Google Directory update, so it could happen any moment. Sorry, no guarantee.
5:20 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



> but any clue when Google will catch up?

Google has never (AFIK) been religious about updating the directory. When will it catch up? When it feels like it :(

I have a new site that got into DMOZ with very little difficulty, and am also periodically looking to see what Google has done with it. To date, nothing.

5:28 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member steveb is a WebmasterWorld Top Contributor of All Time 10+ Year Member



From GoogleGuy's previous statements, I've been expecting the Directory and new PR to show up two or three days after all the datacenters are lining up. They look like they are now, so wild guess, considering Google's love of holidays... and their moving into a new Brave New World of updating, that the Directory and pagerank will update on US Independence Day, July 4th.
6:22 pm on Jul 1, 2003 (gmt 0)

10+ Year Member



Why isn't this in the Google forum? This isn't a DMOZ topic.
6:34 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



> This isn't a DMOZ topic.

Last I heard this forum was about directories, not just about DMOZ.

Decided to check the Charter (always a good idea!) and found:

Also, questions about when various directories update are allowed here.
8:11 pm on Jul 1, 2003 (gmt 0)

10+ Year Member



The question wasn't about when DMOZ updates its directory, but rather when does Google update its directory.

Kctipton's point stands.

9:45 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member steveb is a WebmasterWorld Top Contributor of All Time 10+ Year Member



This question is about when Google will update the Google Directory, not its search database. This is the forum for Directory questions, and the thread title is clear the question is about the Google Directory.

added... the forum charter says for Yahoo topics to use the Yahoo forum. It does not say for Google Directory questions to be put in the Google News forum, maybe it should, but instead it does clearly say that Directory update questions should go here, so at this point this question is not misplaced... even if the basic answer will always be "nobody knows when google will update its directory".

10:28 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member rfgdxm1 is a WebmasterWorld Top Contributor of All Time 10+ Year Member



From the point of view of some people who read here, the version of the ODP that is the Google Directory is more important. For a webmaster, they care about the traffic that a directory listing drives to their site. I'm guessing that typically sites get more hits from the Google directory than dmoz.org. Anyone here with high traffic sites that can comment about what the frequency of hits between the Google Directory and dmoz.org is?
10:29 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member steveb is a WebmasterWorld Top Contributor of All Time 10+ Year Member



That would be off topic this thread rfgdxm1.
10:39 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member rfgdxm1 is a WebmasterWorld Top Contributor of All Time 10+ Year Member



Good point about that question being off topic in this thread. All please ignore that.

And, reading the charter, I notice the URL for Resource Zone has now been removed. Interesting.

10:53 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member steveb is a WebmasterWorld Top Contributor of All Time 10+ Year Member



"I notice the URL for Resource Zone has now been removed."

Which would also be off-topic for this thread.....

Skibum/laisha/Brett... maybe it would be nice to add a line to the Charter that says exactly where Google Directory questions should go, here or the Google News forum. Some people will ignore it either way, but it might be nice for it to be explicit which forum is the appropriate one for Google directory questions.

11:15 pm on Jul 1, 2003 (gmt 0)

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month



The Google Directory is often 6 to 10 weeks behind whatever the ODP is doing. It seems to take Google a long time to publish the data that they download from the ODP. I have no idea what they have to do it once they have received it. The ODP produces a new RDF about every week, so new data is always there to be had.
10:10 pm on Jul 2, 2003 (gmt 0)

10+ Year Member



In terms of priorities, I don't think updating the Google directory to the latest DMOZ version is at or near the top of their agenda.
4:28 pm on Jul 3, 2003 (gmt 0)

10+ Year Member



The ODP gets over 100,000 changes per month. Google updates about once a month.

A webmaster's concern over the progress of one of those changes is understandable; Google's lack of a corresponding concern is inevitable.

Somewhere in the Googleplex there's a dark cubicle with a little post-it note on the wall saying something like "27th of the month -- run "reload_dir" script. If the usual denizen of that cube is sick on the 27th, Google corporate management probably won't notice.

4:42 pm on Jul 3, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



It seems to take Google a long time to publish the data that they download from the ODP. I have no idea what they have to do it once they have received it. The ODP produces a new RDF about every week, so new data is always there to be had.

They modify the directory structure, and add PR values. I have noticed that the PR values displayed next to the listings are often different than the PR values for the actual sites.

I don't know if there's any way they could put the thing on autopilot and rebuild quickly every time they get the RDF dump. But I use the Google directory because the ODP servers are very slow at times.

7:39 pm on Jul 3, 2003 (gmt 0)

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month



>> >> I have no idea what they have to do it once they have received it. << <<

>> They modify the directory structure, and add PR values. <<

Oh, I was talking at the system level. They have to parse it, extract the data, and build it into some sort of database format, then manipulate that data. The first two items may not be straightforward as the ODP RDF dump is not pure RDF format, as the ODP dump was designed before the official standard was finalised, and therefore deviates from it in several very important ways, documented at [rodan.ncc.com...] .

9:50 pm on Jul 3, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Oh, I was talking at the system level. They have to parse it, extract the data, and build it into some sort of database format, then manipulate that data. The first two items may not be straightforward as the ODP RDF dump is not pure RDF format, as the ODP dump was designed before the official standard was finalised, and therefore deviates from it in several very important ways

Pure RDF format or not, wouldn't it be in the same format for every dump? There are even tools [dmoz.org] that can automate parsing. My guess is that modifying the data after it's been parsed is what takes time.

 

Featured Threads

Hot Threads This Week

Hot Threads This Month