Forum Moderators: open

Message Too Old, No Replies

LG/U8120/v1.0

ignoring robots, nocache tags

         

keyplyr

6:47 pm on Apr 5, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



217.12.8.122 - - [05/Apr/2006:10:17:46 -0700] "GET /images/file-name.gif HTTP/1.1" 200 2820 "-" "LG/U8120/v1.0"

Got a couple hundred of these this morning. No request for robots.txt, no HTML, just image file requests - all from directories disallowed by robots.txt. IP range is Yahoonet Europe. I'm guessing they're caching images for their mobile service (LG being a phone.) They're also ignoring the NOCACHE tags, but then they aren't requesting the webpages.

volatilegx

1:53 pm on Apr 7, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You could always use mod_rewrite to deny requests for *.gif and *.jpg files without HTTP_REFERER of your domain.

keyplyr

11:56 pm on Apr 9, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Thanks volatilegx, I stopped them when I saw it. This was more of a FYI post identifying the spider and its behavior.

Ocean10000

8:18 pm on Apr 11, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



hello keyplyr

The UserAgent string you posted is a mobile phone browser. I am assuming its going though a proxie to get what it needs. Do a search on "LG U8120", and you will be able to discover more about this phone.

keyplyr

12:35 am on Apr 12, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Do a search on "LG U8120", and you will be able to discover more about this phone.

Yes thanks, I used to have an LG. Once again, I'm not confused to what it is or how to block it. The topic of this forum is 'Search Engine Spider Identification' and as such, I am identifying this user agent's behaviour.

Ocean10000

2:43 pm on Apr 14, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Just a quick question for you keyplyr. Do you log files track referrer information? If they did what referrer information if any was sent when those images were requested?

What I have found with many cell/mobile phone browsers is often html pages get grabed/cached by there provider and then proccessed into something phone will be able to deal with, and then the phone grab anything else it needs from there.