Forum Moderators: coopster
I am using cURL to access a remote search query. I am having no problems getting the data, but parsing it into something more usefull is causing me some trouble.
I want to strip out all of the "header data" (everything from <html>.......<table>[GOOD DATA]<more junk>)
I am guessing that Regular Expressions are probably the best way to go, but I have the slightest idea of how to begin. Can anyone share aome info to get me on my way?
Thanks!
HTML::Parser
[search.cpan.org...]
And:
HTML::TableExtract
I use em for tons of screen scraping stuff.
Bill
Anyway, here is a regular expression to get you started:
preg_match("/<table>(.*)<\/table>/Uis", $string, $matches);
print $matches[1];