Forum Moderators: open

Message Too Old, No Replies

another day, another Yandex

         

lucy24

11:22 pm on Feb 25, 2018 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



This is not actually a new range, but it shows up so infrequently that it's worth a mention:

Yandex (including YandexBot and WMT)
87.250.224.0/19
^87\.250\.2(2[4-9]|[3-5]\d)

keyplyr

11:44 pm on Feb 25, 2018 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Ever since Yandex announced *not* to filter by IP range, I've just been doing random manual checks on the ones I don't recognise.

Have they narrowed their valid crawl ranges down to a manageable number now?

lucy24

4:54 am on Feb 26, 2018 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Have they narrowed their valid crawl ranges down to a manageable number now?
In a word: No.

Although I knew the one given above was a Yandex range, I'd never added it to my Ignore list, which currently contains
:: shuffling papers ::
3 (three) /16 ranges
8 (eight) /18 ranges
3 (three) smaller ranges (/19 and /22)

And that's not counting 130.193.32.0/19 which includes Yandex Translate

:: further business with raw logs ::

For 2018 only:
one from 5.45.207
lots from 5.255.250 (about 1/4 the total)
[ faker from 37.72 ]
lots from 77.88.6
some from 84.201.133
a few from 87.250.224
lots from 93.158.161
some from 100.43.81, 85
one from 178.154.171
vast numbers (1/2-2/3 the total) from 141.8.143-144, almost all from the exact IP 141.8.144.18. Last year at this time it was 141.8.143.141; I think 141.8 has been their favorite neighborhood for quite a while.

keyplyr

6:26 am on Feb 26, 2018 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I thought the IP ambiguity was limited to growing pains. I had expected more specific ranges after they got established in Palo Alto, but nope.

IMO having an explicit crawl range is required equipment for a major SE. How else are webmasters supposed to account?

lucy24

9:23 pm on Feb 26, 2018 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Did you notice that I found a faker up above? It came through as a 403--not because I cross-check UA and IP, as one would with Google, but because it sent entirely different headers than the real Yandex.

I've long been intrigued by Yandex's combination of {vast array of IP ranges} with {clear preference for a specific IP down to the last digit}.

keyplyr

9:51 pm on Feb 26, 2018 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



[ faker from 37.72 ]
Reoccurring or just once?

lucy24

1:16 am on Feb 27, 2018 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Just once. I don't see any part of 37.72 very often; it seems to be a mix of servers and ISPs in assorted countries. Going back over the past year, I find the occasional robot including wp-login attempts, something claiming to be from uptime-eu.net, one human image request. Shrug.

Admittedly it's unusual for a robot to swing by claiming to be Yandex and only request one interior page.

:: insert boilerplate about what goes on in the mind of a robot ::