Can google crawl site log files? After Google analytics?
I made a search with numbers and found .gz site log files as a result!
matrix_neo
12:05 pm on Nov 18, 2005 (gmt 0)
I have made a search to find out some information using a phone no, but ended getting lot of .gz files with site log I wounder how long it is hapening! Isn't sensitive data to be available on net?
Section_8
7:53 pm on Nov 18, 2005 (gmt 0)
Are your gz files in a separate private directory stored on the server? This tends to happen when they are not located correctly on the server side. Google doesn't really care what it finds, as long as it thinks it is relavent to the search string. So it will (for some odd reason) pick up gz files. Just make sure they are stored correctly. May take google a few days to drop the string from it's search database, but you'll be good to go. And the end user will end up with a dead link for those couple days it takes for it to go down.
matrix_neo
7:42 am on Nov 19, 2005 (gmt 0)
Thanks for your response my log files are in Logs folder and html files are in public_html folder both are in the same level and my index file is only inside the public_html and no way there is link to my log file then how google is crawling those files? can it access files that do not have any link whatsoever on the whole internet? Unless it does knowingly based on folder structure? Correct me if I am wrong?
Section_8
5:01 pm on Nov 30, 2005 (gmt 0)
Strange....I'm honestly not sure then. Dang... :( It really doesn't make much sense at all does it? Google, usually ONLY spiders directly linked documents. I'm at a loss.
vincevincevince
5:10 pm on Nov 30, 2005 (gmt 0)
I would imagine that there is a link... somewhere...
physics
5:30 pm on Nov 30, 2005 (gmt 0)
ended getting lot of .gz files with site log
Are they _your_ log files or someone elses? You never specified that...
physics
5:31 pm on Nov 30, 2005 (gmt 0)
In any case log files have been available on the 'net for some time ... this is the motivation behind log spamming.