Forum Moderators: open
Host: 173.244.181.29
/
Http Code: 403 Date: Jul 02 23:25:20 Http Version: HTTP/1.1 Size in Bytes: 13
Referer: -
Agent: Mozilla/5.0 (Windows NT 6.3; rv:36.0) Gecko/20100101 Firefox/36.0
This one
Now I have one 1.2MB
None of the .htaccess files for my five sites is more than 6 kilobytes.
...it's also a matter of how much time you want to spend on this matter.Webmaster priorities vary greatly I would assume. Some are hobbyists, some run fan pages or family sites and others are in business, to varying degrees. My main site pays my monthly mortgage and most of my other bills, so I spend the amount of time necessary.
The longer it is, the more you yourself have to plow through when editing it.Well as I've said in other threads, I don't have very many rewrite rules, probably under a dozen lines so there's nothing to "plow through." However, it's probably a matter of perception. I'm used to working with my file. I know where everything is :)
I don't enjoy looking at logs and working on .htaccess, and prefer to spend my time in other ways
One of my sites has no link from the homepage to homepage itself.Well, no reason why any site would ever need a link from any page to itself. I'd count it as "worse than useless" because it confuses the user.
Anything that sends a referrer as root of that domain ... is banned on the spot.I've got a whole cluster of this kind of site-specific lockouts. mod_rewrite by itself can't globally block auto-referers,* worse luck, but some patterns can be ruled out. No matter how your site is laid out, there will be universals, like claiming to come from EXAMPLE.COM when the site is really www.example.com.
Well, no reason why any site would ever need a link from any page to itself. I'd count it as "worse than useless" because it confuses the user.Maybe you misundstood (or maybe I am) but it is standard practice for sites to link their logo back to the "home" page. Almost every major site does this, including all mine.
Anything that sends a referrer as root of that domain ... is banned on the spot.My version of that is - I block any "page" request that includes that respective page as the referrer. Luckily my server config does not redundantly include the same referrer page when the browser reloads.
it is standard practice for sites to link their logo back to the "home" page
Luckily my server config does not redundantly include the same referrer page when the browser reloads.
I block any "page" request that includes that respective page as the referrer.
Sure, but you don't need it to be an active link on the home page itself.Don't need to, but if I have the logo linked to the homepage on 400 pages. If the logo on the homepage doesn't do the same thing, I compromise the trust factor. IMO consistency is an asset. Visitors need to trust the site. Besides, I built it into the page header I include atop all pages. Not to have it on one page would cause a second header to be needed.
I doubt it has anything to do with the server. A browser simply doesn't send a referer on a refresh/reload; it's functionally the same as a bookmark request.Of course, that's what I'm saying... so when my script identifies this behavior, it is a bot.
How? I mean, how, mechanically? Do all page requests detour via a quick php script, or is it a very small site? (Other than the front page, I can only block auto-referers for a handful of specified html pages, each coded individually.)Well "detour" implicates leaving the normal path. My server-side scripting is more succinct than that IMO. This is what I meant when I said I got rid of most all the rewrites that once resided in htaccess. Depending on the request header, some hits run through cgi scripting that, if conditions matched, run several rules. The "no-self-referring" rule does allow for query string & parameter occurrence by method as well as host.
One of my sites has no link from the homepage to homepage itself. Anything that sends a referrer as root of that domain(http :// www.domain.tld/ ) is banned on the spot.
But that code doesn't work against the type of bots that I wanted to discuss when I started this thread, because they don't provide a referer.I invite you to consider the benefit of parsing request headers. Bots will usually send malformed header data that can be used to identify them as bots.
consider the benefit of parsing request headers
[edited by: bill at 12:59 am (utc) on Jul 13, 2015]
[edit reason] fix formatting [/edit]