Forum Moderators: open
They are asking for a file but using a malformed domain without the suffix:Err. No they are not.
GET /%7Emy.domain/rss.xml
/~my.domain/rss.xml. 10.20.30.40/~my.domain/rss.xml or yourhostingcompany.com/~my.domain/rss.xml They are asking for a file but using a malformed domain without the suffix:
GET /%7Emy.domain/rss.xml
Err. No they are not.
They are requesting the path and file /~my.domain/rss.xml.
My site is my-domain.org.uk/No. That is not what is happening. The GET request as logged shows only the path and file.
They are asking for my.domain/
Surely that IS a malformed domain? They've turned the hyphen between my and domain, into a dot.
10.20.30.40/~my.domain/rss.xml or yourhostingcompany.com/~my.domain/rss.xml robots.txt file loaded in your filespace because that file will appear to be at: 10.20.30.40/~my.domain/robots.txt or yourhostingcompany.com/~my.domain/robots.txt robots.txt file needs to be in the root of the requested domain to have any effect. .htaccess code ahead of all your other code to see if redirecting those requests to your own URL space makes any difference (it should do). RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /(~|%7E)my\.domain
RewriteRule (.*) http://www.example.com/$1 [R=301,L]
RewriteCond %{HTTP_HOST} !^(www\.example\.com)?$
RewriteRule (.*) http://www.example.com/$1 [R=301,L] [edited by: g1smd at 7:37 am (utc) on Sep 18, 2011]
don't want feedfetcher calling every forty minutes and ignoring robots.txt which only allows named bots. I know it can't read robots.txt now because of my useragent ban