homepage Welcome to WebmasterWorld Guest from 54.196.197.153
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Pubcon Platinum Sponsor 2014
Visit PubCon.com
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
Google indexed the content of my zip files
~15 mb each, wmv filename inside is listed
danimal




msg:775747
 5:36 am on Sep 2, 2005 (gmt 0)

the site: command search is bringing up a list of the zipped video files on my site... google has attempted to index all of the content of each of the zip files, it's showing up as gibberish text surrounding the filename that's inside the zip file.

some of these zipped video files are showing up as what google thinks is a page full of japanese content that it attempts to translate, lol.

i can find no reference here or in the google website faqs about this capability of crawling and indexing the actual content of zip files... needless to say, it's trashing the google index with worthless data.

several weeks ago this website took an inexplicable nosedive in the ratings, so i'm also wondering if this zip issue is relevant to that.

 

Brett_Tabke




msg:775748
 11:48 am on Sep 2, 2005 (gmt 0)

very interesting. Thanks for sharing.

kaled




msg:775749
 2:52 pm on Sep 2, 2005 (gmt 0)

Some time ago, several .exe files from my site were indexed. This wasn't a bug, they were correctly identified by Google as Windows executables. I cannot begin to guess why Google would be insterested in .exe files and they are no longer indexed.

Google probably has the technology to recognise many file formats that they do not normally index. In this case, something probably went wrong with experimental code.

Kaled.

Chico_Loco




msg:775750
 4:02 pm on Sep 2, 2005 (gmt 0)

Can you check your server headers and let us know if you server is reporting the correct content-type for your zip files.

If the server is reporting them as text or other type files then that would explain something.

2hot2handle




msg:775751
 6:45 pm on Sep 2, 2005 (gmt 0)

Hi everybody,
The problem is OLD one. Google index .jpg .gif .zip .arj etc.

I notice Google about this.

[edited by: jatar_k at 7:02 pm (utc) on Sep. 2, 2005]
[edit reason] no email quotes [/edit]

danimal




msg:775752
 6:57 pm on Sep 2, 2005 (gmt 0)

that's a good idea... i found an online tool at the seoconsultants website, entered the full url of one of the zip files, and it returned a 404 not found error, with the content-type: text/html.

it's a reseller account on an apache server... the cpanel mime types show "application/zip zip".

danimal




msg:775753
 7:58 pm on Sep 10, 2005 (gmt 0)

just an update... it now appears that google has stopped listing the zip files as part of the site: search.

traffic to the site has not returned to previous levels, but i'm glad that they got the zip problem fixed.

caveman




msg:775754
 6:28 pm on Sep 11, 2005 (gmt 0)

Yeah, not new.

But this thread serves a great reminder: If it's on a connected server, you're choosing to share. Don't be too consoled by the fact that the file is not viewable when doing a search. Neither are all of your site's backlinks.

...all of the worlds information...

danimal




msg:775755
 7:27 pm on Sep 11, 2005 (gmt 0)

the zip files are linked on the webpage... i put 'em up as zips instead of wmv because it doesn't use as much bandwidth, you have to really want to see the content to go thru the zip hassle.

the point of this thread is that google does *not* post the actual content of zip files as search results... i believe that this was an abnormality that they fixed.

hopefully they will fix the bogus backlink reporting someday as well :-0

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved