/COLLATE/ but they weren't matching. When I noticed the input file was Unicode, I saved it as "non-Unicode" and tried again, and the patterns matched.
But I need to be able to parse the original Unicode.
I'm running Perl 5.8.1 on Mac OS X.
I reckon this is an easy one, but Googling has not helped. Any help is appreciated.
[perl.com...]
Unfortunately, there is currently no way to tell Perl that incoming data from an external file is Unicode; while you can write Unicode data out to a file, you cannot read Unicode data back in again. While you can work around this with tr///CU, it's obviously a serious shortcoming, which we hope will be addressed soon.