| DMOZ Dump Related Categories |
AndAgain

msg:475490 | 12:03 am on Jun 12, 2005 (gmt 0) | Does anyone know two things:? How are rel@ted categories dealt with as far as data design goes. And I have used a couple of tools that transform the rdf dump into data tables/txt files...but not only are related categories not dealt with but it leaves it very unclear as to what data columns are really all there to work with. Before I get busy and to get this off my mind...who knows what columns are in the rdf dump to work with? Thank you in advance, AndAgain
|
g1smd

msg:475491 | 6:14 pm on Jun 12, 2005 (gmt 0) | There are two files for the RDF. One of them defines the structure. In there should be noted the relcat and @link information. The other file just lists the URLs of the sites and which category they are in. [rdf.dmoz.org...]
|
tschild

msg:475492 | 8:58 am on Jun 14, 2005 (gmt 0) | The following files in the ODP RDF download directory contain a small subset of categories and listings so you can look at the data structure without handling gigabytes of data: content.example.txt (for the site listings) structure.example.txt (for the category structure, @links, related links, etc.) (the TOS for this forum seem to preclude giving the URLs for these files but you can just google for them).
|
|
|