Forum Moderators: open

Message Too Old, No Replies

Unique string at end of UA

         

wilderness

6:48 am on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



These began in my raw logs about mid-day yesterday.
Each string a different number, and following the closing quote of the last UA field..
EX: (opening and closing asterisks are part of string)
**0/156098**

Is this unique to my host?

jmccormac

8:56 am on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Scraper botnet? Check for CSS/images not being grabbed. Possibly using ISPs (compromised routers) from countries that don't usually send much traffic.

Regards...jmcc

wilderness

3:46 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



jmcc,
thanks.
These strings are added to each and every log entry. Regardless of IP, file type, or UA.
On a full page request (html and all accompanying files, each request has a different unique string).

lucy24

4:04 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Is this unique to my host?
Probably. Poring over Apache docs [httpd.apache.org] suggests there's nothing to stop them from adding a section with literal asterisks, and there are all kinds of things that might have a numerical value. They're just not included in default log configurations.

I don't know how communicative your host is--mine has grown less helpful over the years--but it can't hurt to ask. It might even be something they put in to track a server tweak, and then forgot to take out again.

jmccormac

4:10 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Is there any pattern to the IP ranges? I've seen botnet activity where a unique string was being used in the UA but this looks slightly different. Is the traffic coming from ISPs or data cenrres? (Digitalocean, OVH, Leaseweb, Nobis etc)?

Also, has there been any change to the webserver's configuration?

Regards...jmcc

lucy24

4:12 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I've seen botnet activity where a unique string was being used in the UA but this looks slightly different.
That was my first thought, but then I reread OP and saw that he's describing something appended after the UA string.

wilderness

4:22 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Once again, a different string to the end of each and every log entry. (lucy, I was unable to locate anything on google).

Another EX.
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /houtai/login.php HTTP/1.1" 403 - "-" "-" **0/23637**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /admin/login.php HTTP/1.1" 403 - "-" "-" **0/22443**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /wang/login.php HTTP/1.1" 403 - "-" "-" **0/26660**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /jian/login.php HTTP/1.1" 403 - "-" "-" **0/39580**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /dede/login.php HTTP/1.1" 403 - "-" "-" **0/16846**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /dedea/login.php HTTP/1.1" 403 - "-" "-" **0/32524**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /chen/login.php HTTP/1.1" 403 - "-" "-" **0/34054**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /caiyuan/login.php HTTP/1.1" 403 - "-" "-" **0/45859**
112.x.x.x - - [29/Aug/2020:03:49:14 -0700] "HEAD /guanli/login.php HTTP/1.1" 403 - "-" "-" **0/42047**
112.x.x.x - - [29/Aug/2020:03:49:16 -0700] "HEAD /dede123/login.php HTTP/1.1" 403 - "-" "-" **0/30708**

jmccormac

4:28 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Those seem to be Chinese probes for Chinese market CMSes. They might be coming from Chinese ISPs or from Alibaba ranges. The appended numerical strings look odd. They are being 403ed though.

Regards...jmcc

wilderness

4:33 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



jmcc,
The IP, file types (and/or requests) and even UA are all irrelevant.

The focus is ONLY the string appended to end.

jmccormac

4:48 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Are all the requests 403s or are there any 200s? It is possible to append a compression ratio/size with mod_deflate (Apache) and it could be something to do with the webserver configuration file in the logging format section.The asterisks are odd though.

Regards...jmcc

lucy24

7:06 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



compression ratio
Huh. I couldn’t think what would fit the pattern of:
non-sequential
each number different
overall variation by a factor of 2-3, all the same order of magnitude

I thought it might be %I which is the size in bytes of the request, not the response, but unless I'm overlooking something, all those values are off by two orders of magnitude (should be in the hundreds, not in the tens of thousands).

Don, the reason we're asking about UA is just for reassurance that the UA is present in logs in other cases, whenever one is sent. (Malign robots of the cheapest kind don't bother to send a UA string, which makes them easy to block.)

My hunch is that the asterisks are delimiters supplied by the config file, not part of the string being logged. In fact it's something you might choose to do if you wanted to draw attention to a less-standard aspect of logs. (I was going to say “non-standard”, but there's a finite list of things that can be logged at all.)

wilderness

7:59 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



lucy,
here's two requests from Google this morning.

66.249.73.39 - - [29/Aug/2020:05:06:29 -0700] "GET /MySub/Sub-sub/MyPage.html HTTP/1.1" 200 15395 "-" "Mozilla/5.0 (Linux; Android 6.0.1; Nexus 5X Build/MMB29P) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.92 Mobile Safari/537.36 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" **0/35617**
66.249.73.39 - - [29/Aug/2020:05:07:00 -0700] "GET /MySub/Sub-sub/MyPage.html.html HTTP/1.1" 200 15395 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)" **0/11271**

lucy24

9:29 pm on Aug 29, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Definitely mystifying. So in this specific case, the mobile Googlebot's request involved three ... something ... as much/many as the vanilla Googlebot for the identical page. But this seems to be a red herring, based on other log entries.

Could it be time? either
%D
%{us}T
where "us" (I assume that's an ASCII approximation of mu) in the %T form is microseconds.

Numbers in five figures would mean something in the tens of milliseconds. Is that a reasonable server processing time? (I have no idea, so I hope someone else does.)

wilderness

5:54 pm on Sep 13, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



17th day and strings continue.

robzilla

12:48 am on Nov 28, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Could be a bunch of things. Process ID, port number, bytes in/out, time taken to serve the request... check your httpd.conf for the LogFormat and you'll know. Or, as previously noted, ask your hosting provider if the Apache configuration is not under your control.

Seems harmless though, just some diagnostic probably. Maybe someone forgot to remove it.

tangor

6:03 am on Dec 1, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Just curious ... is this till happening?

Or has it gone away?