Forum Moderators: coopster

Message Too Old, No Replies

PHP class to parse Wikipedia articles?

How do I get HTML out of an article without MediaWiki?

         

rincey

7:01 pm on Aug 17, 2004 (gmt 0)

10+ Year Member



Hi,

after spending a day searching for a piece of PHP that parses a full featured Wikipedia article into nice HTML I am near throwing in my towel.

I tried to extract the parsing functions from the Mediawiki sourcecode - without success. Now I spent some hours trying the get Text_Wiki to work with a set of Mediawiki rules - without success.

So, how are you doing this? Is there any other PHP class out there which is able to handle this task?

Ad

coopster

10:26 pm on Aug 19, 2004 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



>>So, how are you doing this?

I'm not ;)

If I was, I may use something like file_get_contents() [php.net] or another function to retrieve the file into a variable or array and then use regular expressions [php.net] to peel out what I wanted/needed. Well, that would be one way at least.