Forum Moderators: open

Message Too Old, No Replies

Why can't google spider DMOZ and get the directory manually?

You'd think it would be in their best interest....

         

born2drv

8:12 am on Jan 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Wouldn't it be better to spider DMOZ several times a month and keep the directory fresh rather than wait for the rdf dump? Am I missing something here?

victor

9:20 am on Jan 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Spidering gets you all the pages you can find. It doesn't necessarily get you the correct directory structure.

One example: imagine during the days (ODP allows only one page per second per spider, so googlebot will run s-l-o-w) the pages are being spidered, the top-level cat "Recreation" is renamed "Sports and Recreations". Googlebot will either find it twice under different names or not at all.

Dreamquick

9:59 am on Jan 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Also you have the fact that a single data feed is easier to get and doesn't hammer the DMOZ site like a complete crawl would...

- Tony