g1smd - 6:39 pm on Apr 9, 2004 (gmt 0)
The entire directory content has been converted to UTF-8 over the last few months. After converting 4 MILLION entries, there were about 20 000 encoding errors and glitches remaining, and all but the last few hundred have been fixed by hand or by running selective scripts in the last couple of weeks. Expect spidering of the ODP to improve a little more now, as well as usage of the RDF files produced from the end of the month onwards to be easier to handle too. A lot of work has gone into the conversion, and there are just a few glitches here and there to correct.