Forum Moderators: open

Message Too Old, No Replies

Why does google GET images?

why not just HEAD them to see that the file exists?

         

bcc1234

12:37 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Do they keep the images cached somewhere?

How long does it take for the images to get indexed?

[edited by: bcc1234 at 12:40 am (utc) on Dec. 11, 2002]

deejay

12:40 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I haven't noticed Google getting my images -

although they obviously did at some stage when the site was with my old host, as my images are included in their image search. :(

oh, as to whether they cache them - they must do, because my images don't show hits on my logs when I search for them on Google.

[edited by: deejay at 12:42 am (utc) on Dec. 11, 2002]

bcc1234

12:42 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Googlebot does up to 7 hits per second for images.
Already got a few thousand and does not seem to stop.

deejay

12:43 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



yikes.. that could get pretty heavy pretty quick.

What IP is the bot using?

bcc1234

12:46 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



64.68.86.

Visit Thailand

1:25 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Is there anyway to stop g/bot from caching the pics but still letting her in to get the url?

deejay

1:54 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



[google.com...]

The link is to Google's FAQ's. If you don't want your pictures indexed, you could ban googlebot from your images directory if you have one, or you could ban it from grabbing *.gif or *.jpg.

Interesting - i didn't know you could specify a file type rather than an area.

jdMorgan

2:12 am on Dec 11, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Just be advised that these extensions to the robots.txt protocol are Google-specific. Few (if any) other robots support wild-card file names or the Allow directive. If you use them, put these Google-specific directives into a Google-only User-agent section of robots.txt, and include a separate section for other 'bots.

Jim

zechariah

3:48 am on Dec 11, 2002 (gmt 0)

10+ Year Member



From this forum's robots.txt there is,

User-agent: Googlebot-Image
Disallow: /

It would most likely make google not get your images?
I hope this helps.