Forum Moderators: open
Q/
I am very sorry to have been a nuisance, and I have no wish to cause offence.
/Q
Terrific line for a bot UA.
not the browser or UA
SELECT
concat( f.ip, '/', f.mask ) ,
substring_index( from_unixtime( max( m.postertime ) ) , ' ', 1 )
FROM (
SELECT * FROM `smf_messages`
GROUP BY substring_index( posterip, '.', 3 )
) AS m
INNER JOIN smf_farms AS f
ON inet_aton( m.posterip ) & ( -1 << ( 32 - f.mask ) ) =
inet_aton( f.ip ) & ( -1 << ( 32 - f.mask ) )
GROUP BY f.ip
ORDER BY substring_index( from_unixtime( max( m.postertime ) ) , ' ', 1 ) DESC
54.80.0.0/12 2014-07-26 -- amazon
93.112.0.0/13 2014-07-26 -- voxility
93.115.80.0/20 2014-07-26 -- fullshop romania (voxility?)
54.192.0.0/12 2014-07-14 -- amazon
146.185.0.0/16 2014-06-11 -- HSI 100TB (was netsumo)
5.63.144.0/21 2014-06-09 -- HSI 100TB (was netsumo)
74.115.0.0/21 2014-04-01 -- anchorfree
91.108.180.0/22 2014-03-18 -- webexxpurts
65.192.0.0/11 2014-02-05 -- colostore (now some MCI/verizon?)
69.40.0.0/13 2013-09-04 -- windstream
2.232.0.0/13 2013-07-08 -- fastweb
69.174.0.0/17 2013-03-06 -- scansafe
209.251.192.0/19 2013-02-08 -- tampa time inc
93.114.40.0/21 2013-02-05 -- voxility
213.235.192.0/18 2013-01-26 -- austria tele2
209.68.0.0/18 2013-01-24 -- pairnet
69.48.0.0/12 2013-01-09 -- HSI/intergenia There's not much point to blocking 224.0.0.0/3 -- or indeed of referring to this sector in any way whatsoever -- since it appears to be perpetually unassigned (yes, even while 185 is being doled out in /22 slivers). Just makes a smidgen more work for the server.
224 to 239 is multicast so will not be used for normal web traffic of any kind. 240 up seems to be reserved for multicasting.
It turns out I have seen a handful of visits in 2014 from north of 224/8, and also north of 240/8. Hackers maybe?
As an example of this idea in practice, here is a list of the /8s that have never posted on my forum (in 5 years). So from my perspective, any candidates for a deny from /8 should be in this list.This kind of approach bothers me because it is so subjective. I did build a global map of IP addresses and owners as part of a mapping project a while ago. However I am not the average WW poster when it comes to this kind of thing and creating blocking ranges for webservers was not the main intent of the research. Ranges are redelegated and reassigned so it is possible that there could be ranges in various /8s that are human use rather than data centres.
but with essentially every request from a different /32 within the same /24
^(\d+\.\d+\.\d+)\.\d+ {blahblah} \1\.\d+ :: idly wondering if anyone reading these forums has ever had a visit from 175.45.176.0/22 ::
---------------------------------------------------------------x-x----xx
=--=-============-=========================-=-=====-==--=--==-======--== -----------------------------------------------------------------------x
===-=-============-======================-==-=-=====-==--=---=-======--= -----------------------------------------------------------------------x
-----------------------------------------------------------------------= --------xxx-xx-xxxxx--xxxxxxxxxx-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx-xxx
==--------------====-==========-======-----------=-=----================ --x--x--x---x----x--x--x--x--x--x--x---x--x--x--x--x----x--x--x--x--x--x
--=--=--=---=----=--=--=--=--=--=--=---=--=--=--=--=----=--=--=--=--=--= ---x----------------------x---------------------x-x-------x--x---------x
--=-----------------=-----=--------------------------------------------= xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
=-==---==-=-=--=-=-======-=-===-=----==-------===----=------=-==-------= xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
======================================================================== My crawl delay is set at 100s, so they're actually obeying it to the letter... that's unusual for a bot.
Mozilla/5.0 (Windows NT 6.1; rv:12.0) Gecko/20120403211507_____Firefox/12.0
Plain clothes pinterest, it would seem