DMOZ clones

Forum Moderators: open

Message Too Old, No Replies

DMOZ clones

Is it worth to create one?

moltar

5:41 pm on Feb 13, 2004 (gmt 0)

I want to create a DMOZ clone, but only for a certain sub category, but fairly large one. Do you think it's worth the time, money and trouble? Will it rank high in SEs?

Do you know of any software/scripts that will parse RDF dumps into a database and then display it on the site? Something usable and customizable.

windharp

5:20 pm on Feb 15, 2004 (gmt 0)

Regarding the second question: [dmoz.de...] and the subcategory "Upload Tools" might help you :-)

AthlonInside

5:32 pm on Feb 15, 2004 (gmt 0)

What is the motif behind that? To draw traffic so you can redirect it to another site of yours? If that's the case, I don't think it is worth the time. And even it is not the case, i still don't think it is worth the time. ~~ Sounds a bit like nonsense ~~ Too Sleepy ~~ :)

moltar

5:01 am on Feb 16, 2004 (gmt 0)

THe motif is that I want to create a local directory of sites. Then use adsense to generate profit.

sem4u

9:58 am on Feb 16, 2004 (gmt 0)

Sounds like a good idea to me :)

jmccormac

10:22 am on Feb 16, 2004 (gmt 0)

A few Dmoz clones seem to be created specifically for Adsense. Generating a local type directory with real content is good thing but relying purely on Dmoz is risky.

Processing the RDFs from Dmoz is easy enough. And plug and play solutions based on php or perl are available - these pull the data from Dmoz dynamically thus saving you the hassle of developing your own system. (I could not find any decent programs to process the RDFs into MySQL so I had to write my own.)

Regards...jmcc

Teshka

11:25 am on Feb 16, 2004 (gmt 0)

>I want to create a DMOZ clone, but only for a certain sub category, but fairly large one. Do you think it's worth the time, money and trouble?

Just my opinion of course... I personally would build my own directory instead both because it offers something for the user that's not identical to something they can get elsewhere, and it's probably safer in the long run. The way Google totes "original content" I wouldn't be surprised if they eventually dropped dmoz clones out of their index.

>Will it rank high in SEs?

Why would it? There are ready 10,000 DMOZ clones out there. If you're linking to it from a high PR site, sure, but otherwise probably not.

moltar

1:28 pm on Feb 16, 2004 (gmt 0)

Well, the difference would be that I will remove all the dead/expired sites from my directory and also allow users to add their sites right into my directory.

jmccormac

10:50 pm on Feb 16, 2004 (gmt 0)

Well, the difference would be that I will remove all the dead/expired sites from my directory and also allow users to add their sites right into my directory.

On a small scale directory this is easy to do. However Dmoz is a rather large directory with at least 3.5 million links. The problem is that cybersquatters tend to target domains that are in the Dmoz directory so that a domain can change hands between Dmoz updates. This is a big problem with using the Dmoz dataset. Dead sites are easily flagged as such but if they are reactivated then they have to be checked again. The whole process of dead/expired/active checking has to be run on a continual basis and it has to be highly automated to be effective.

Everybody seems to think that running a web directory is easy until they try it and learn otherwise. On a small scale it is relatively easy but when you get to country level, it can become a fulltime job.

Regards...jmcc

Palehorse

1:00 am on Feb 17, 2004 (gmt 0)

I recommend gossemer threads (spelling?) I use it, and it lets you generate dynamic OR static pages, you can edit the site templates to make it totally original and it has it's own spider to check for dead links. :)

Really a sweet piece of software... No, I don't work for them.

moltar

4:28 am on Feb 17, 2004 (gmt 0)

jmccormac: I realize the difficulties involved in running a directory. Maybe not to full extent, but I know that it's not an easy task. And know that it needs continuous development and support.

Palehorse: I found the soft, but it doesn't seem to support DMOZ dumps. Did you write that yourself, or they actually include it, but don't say it on their site?

jmccormac

6:47 am on Feb 17, 2004 (gmt 0)

I realize the difficulties involved in running a directory. Maybe not to full extent, but I know that it's not an easy task. And know that it needs continuous development and support.

Good. In that case welcome to the club and good luck with your directory. :) Gossamer Threads is a good program and there is a flat file version available for non-commercial/personal use. The Gossamer Links product is a MySQL based version but it is expensive. However it has a lot more features and is held in fairly high esteem by those who use it. There is an earlier version, Links 2.0 which has a lower licence fee.

Dmoz seems to be getting its act together with updating the RDF files every week or so. The next update should be tomorrow. If your categories are not frequently updated, it may be better to write some kind of low speed crawler that will check the Dmoz pages for updates (the Page Last Updated data is included at the end of each page on Dmoz) and then use a web orientated system to pull the data in. The alternative is to slice your categories from the main content and structure RDFs, check your categories against the 'last updated' data and update accordingly.

Once you have the data in MySQL, it is then just a simple case of generating pages from this in whatever language suits. The hard part is trying to figure out what the Dmoz people were up to when they created the structure. It seems to be a case of fossilisation rather than a coherent structure.

Regards...jmcc

loxly

3:09 am on Mar 7, 2004 (gmt 0)

Yes it is worth it if you have a target audience and use DMOZ as a starting point and plan to let users update/add/delete their links and add appropriate content pages. Adding content and making a template with adsense and other links makes for a directory that should rank high in the search engines and add value to existing directories and give relevant sites another place for linkbacks that count towards google pr.

Copying the entire DMOZ is not effective in my opinion, but using the data to get a local or a content specific directory off the ground is what DMOZ is partially made for. It is Open Source data for sharing and using.

The gossamer-threads folks have a couple of good programs that work very well to set up directories that are interactive, i.e. people get to edit and add their own links. Sourceforge has some open source scripts that you should be able to modify and use also.

Don't set them up expecting to make big bucks, but they should pay for their own server space after a couple of months of being around.

victor

9:28 am on Mar 7, 2004 (gmt 0)

it may be better to write some kind of low speed crawler

Good idea

But let me highlight the term low speed -- it's as per their robots instructions:
[dmoz.org...]

g1smd

11:45 pm on Mar 7, 2004 (gmt 0)

There is a mirror at ch.dmoz.org that can also be spidered too.

DMOZ clones

Is it worth to create one?

moltar

windharp

AthlonInside

moltar

sem4u

jmccormac

Teshka

moltar

jmccormac

Palehorse

moltar

jmccormac

loxly

victor

g1smd

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week