homepage Welcome to WebmasterWorld Guest from 54.167.238.60
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Visit PubCon.com
Home / Forums Index / Search Engines / Directories
Forum Library, Charter, Moderators: Webwork & skibum

Directories Forum

    
DMOZ Dump
Related Categories
AndAgain




msg:475490
 12:03 am on Jun 12, 2005 (gmt 0)

Does anyone know two things:?

How are rel@ted categories dealt with as far as data design goes.

And

I have used a couple of tools that transform the rdf dump into data tables/txt files...but not only are related categories not dealt with but it leaves it very unclear as to what data columns are really all there to work with.

Before I get busy and to get this off my mind...who knows what columns are in the rdf dump to work with?

Thank you in advance,

AndAgain

 

g1smd




msg:475491
 6:14 pm on Jun 12, 2005 (gmt 0)

There are two files for the RDF.

One of them defines the structure. In there should be noted the relcat and @link information.

The other file just lists the URLs of the sites and which category they are in.

[rdf.dmoz.org...]

tschild




msg:475492
 8:58 am on Jun 14, 2005 (gmt 0)

The following files in the ODP RDF download directory contain a small subset of categories and listings so you can look at the data structure without handling gigabytes of data:

content.example.txt (for the site listings)
structure.example.txt (for the category structure, @links, related links, etc.)

(the TOS for this forum seem to preclude giving the URLs for these files but you can just google for them).

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Directories
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved