Forum Moderators: open

Message Too Old, No Replies

Google just spidered all my Image files

Alt Tags

         

mmr82

9:31 pm on Dec 27, 2002 (gmt 0)

10+ Year Member



Hi,

Is that normal? Google just visited all my Image files nothing else. Is Alt tags what they are looking for?

Mohamed

NeedScripts

10:36 pm on Dec 27, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Is Alt tags what they are looking for?

May be not. I think it was after the images. Check out wonderful google search. :)

Finder

10:41 pm on Dec 27, 2002 (gmt 0)

10+ Year Member



Check your logs for the User-Agent "Googlebot-Image/1.0". This is the spider for Google's image archive [images.google.com].

The bot visited me today as well. Fortunately it obeys the robots protocol and didn't take any images.

Marval

1:06 pm on Dec 28, 2002 (gmt 0)

10+ Year Member



Just noticed that I had the image bot all over one of my sites yesterday and today...the only bad thing is it didnt ask for the robots.txt file until the most recent crawl...it was eating images all day yesterday without looking at the robots file.

born2drv

1:33 pm on Dec 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If you guys don't mind me asking a basic question, what is the line you put in your robots.txt file to disallow google image bot?

Macguru

3:23 pm on Dec 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If you happen to have all images in a single folder :

User-agent: *
Disallow: /images/

This will tell all image bots not to go in your images folder. If you have many images folders repeat with full path to all images folders.

User-agent: *
Disallow: /gizmo_folder/images/

User-agent: *
Disallow: /widget_folder/images/

If you want to disallow only Google, go for :

User-agent: Googlebot-Image
Disallow: /images/

teeceo

4:23 pm on Dec 28, 2002 (gmt 0)

10+ Year Member



what harm can come from getting your Images spidered? Is it a thing like if you have the same image then you'll get banned or something?

teeceo.

dingman

4:33 pm on Dec 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I think it's more a question of what benefit can come from it. For some sites, there might be a real benefit. For others, it's just using up bandwidth.

As examples: I have a number of slide-show sites. Two of them are travel photo sites and have quite respectable PR, and the whole reason for their existence is to showcase those photos. I'm not about to block a search engine that's interested in those site's graphics. On the other hand, some of them are sub-sections of my family domain with stuff like my aunt's wedding photos. The family site, while publicly accessible, isn't ever going to be of much interest to people outside my extended family. No real benefit to be had from serving those image files to spiders, just bandwidth use.

soapystar

4:53 pm on Dec 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



is
User-agent: *
Disallow: /images/

the same as

User-agent: *
Disallow: /images/ #

Macguru

5:15 pm on Dec 28, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



"#" is used to comment robots.txt files such as <!-- bla bla --> is used in HTML documents.

I would be surprised "#" could be a wild card of any kind.

You could use :

User-agent: *
Disallow: /images/ # Dont you dare indexing my images, stupid bot! ;)

creative craig

2:35 pm on Dec 30, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I just been hit over night by the Googlebot-Image/1.0. This is the first time that Google has done this for any of my sites, when does the image index update etc.

It was pretty active took all the images even image spacers!

Craig