Regex ignore string not char class

Hi everyone,

I'm usually able to find things out using a regex reference guide online but I think I'm overlooking something here, it's probably simple but I can't figuring it out for the past couple of hours.

I have the following regex which works great:

preg_match_all("/href=\"?(.+?)[\" >]/i",$anchortitle_matches_str,$extintlink_matches);

It gives me all links on a site wether they are in <a href="bla.html"> or <a href=bla.html> or <a href=bla.html target=_blank> format.

So that's great.

Anyway, I'm trying to ignore ftp://, mailto:, javascript: etc.
I can do this by simply looping through my result array and ignore any results found using strpos or so, however, I know it has to be possible with regex.

Basically I'm trying to do this:

href=\"?mailto://Śftp://Śjavascript:(.+?)[\" >]

But the opposite. The above ONLY gives me links that DO contain ftp mailto and javascript but I'm trying to ignore the above.

I can't figure out how to properly use ^ or otherwise negate my unwanted links using the above method. Same goes for character classes, whenever I use [ ] regex just ignores them letter by letter so:

href=\"?[^mailto://](.+?)[\" >]

This simply ignores ANY link with an m a i l t o : or a / in it, that's not what I want, i want it to ignore links with mailto:// in it (hence the subject of thing post, ignore STRING!)

I hope that made sense.

Thanks in advance for any of those who can point me in the right direction!

Also I'm aware I could just look for links based on http:// or https:// but the problem is I'm also trying to find internal links so that's no solution :)

[edited by: eelixduppy at 1:29 pm (utc) on Jan. 14, 2008]
[edit reason] disabled smileys [/edit]

Regex ignore string not char class

Regex ignore string not char class

Omala

PHP_Chimp

d40sithui

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week