Welcome to WebmasterWorld Guest from 54.162.76.55

Forum Moderators: Webwork & skibum

Message Too Old, No Replies

DMOZ Dump

Related Categories

     
12:03 am on Jun 12, 2005 (gmt 0)

Junior Member

10+ Year Member

joined:Feb 10, 2005
posts:49
votes: 0


Does anyone know two things:?

How are rel@ted categories dealt with as far as data design goes.

And

I have used a couple of tools that transform the rdf dump into data tables/txt files...but not only are related categories not dealt with but it leaves it very unclear as to what data columns are really all there to work with.

Before I get busy and to get this off my mind...who knows what columns are in the rdf dump to work with?

Thank you in advance,

AndAgain

6:14 pm on June 12, 2005 (gmt 0)

Senior Member

WebmasterWorld Senior Member g1smd is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:July 3, 2002
posts:18903
votes: 0


There are two files for the RDF.

One of them defines the structure. In there should be noted the relcat and @link information.

The other file just lists the URLs of the sites and which category they are in.

[rdf.dmoz.org...]

8:58 am on June 14, 2005 (gmt 0)

Junior Member

10+ Year Member

joined:Jan 27, 2003
posts:166
votes: 0


The following files in the ODP RDF download directory contain a small subset of categories and listings so you can look at the data structure without handling gigabytes of data:

content.example.txt (for the site listings)
structure.example.txt (for the category structure, @links, related links, etc.)

(the TOS for this forum seem to preclude giving the URLs for these files but you can just google for them).