parsing g urls

I have a continuous venture with my PHP based tracking script. I have an issue, and a curiosity question.

Curiosity question:
After a recent PHP update, a script went broke, then got fixed with the help of a couple of people from this community. As I turned on all of the PHP errors and notices (6143), I went after a notice that said: Undefined offset: 1
The code causing it was

list(,$querystring) = split("\?", $querystring);

After looking around, I came up with this:

list(,$querystring) = array_pad(split("\?", $querystring),2,null);

I sort of understand that by doing this, I have excluded an "empty" value, but it's still quite foggy to me. For example, what's the meaning of "2" in the code?
No errors, no more notices, script works for this part.

An issue:
One part of the script deals with Google, both ads and organic search. This part works fine except that from time to time it will throw in the https:// from parsed URL. Here is the code:

function google ($ques, $querystring, $referer, $url)
{
$patterns = array('/www\./', '/\.com/' , '/\.co/', '/google\./');
 $replacements = array('', '', '', '');

list(,$querystring) = array_pad(split("\?", $querystring),2,null);

 $v2 = preg_replace("/^([^\&]+\&)*$ques=([^\&]*)(\&[^\&]+)*$/", "$2", $querystring);

 // check for google.com/google.co.country/google.com.country/google.country with or without www .
 $country = preg_replace($patterns, $replacements, $url); 

 // (country == 'google') => non-country case, www.google.com/google.com
 if ($country == "google") $country = "US";
return array($v2, $referer . "-$country");
}

The code above will basically get the country code from URL. If it's google.com, it'll come out as 01-US (01 is a reference for google from an array that is a part of the script). If it's i.e. G from Germany, it'll come out as 01-de, and so on.
Now, when that https:// shows up, it is like this within the variable (few examples):
01-https://de
01-https://google
01-https://fr
01-https://br
...

My limited knowledge tells me that https:// should be a part of that regex line in the $patterns = array.
The script has more lines of the code and I do have a part where I have addressed the http(s) issue I had in the past:

$url = preg_replace("|https?://([^\/]+)/.*|", "$1", $_SERVER['HTTP_REFERER']);
$nonurl = preg_replace("|https?://([^\/]+)/(.*)|", "$2", $_SERVER['HTTP_REFERER'])

but that obviously did not cover the issue I have outlined here.

In any case, how can I ensure that http(s):// part does not show up?

Thank you

parsing g urls

smallcompany

whitespace

smallcompany

whitespace

Join The Conversation

Moderators and Top Contributors

Hot Threads This Week