Forum Moderators: open

Message Too Old, No Replies

Timing out during crawling

Why does it happen?

         

tkarade

9:03 pm on Feb 3, 2003 (gmt 0)

10+ Year Member



Why do some urls time out when crawling a site?
What does it mean when they time out?

Thanks,
TK

wilderness

1:59 am on Feb 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



The pages don't load fast enough or the bot has limit settings on how long it waits for page delivery.

"206" Incomplete page.

tkarade

7:18 pm on Feb 4, 2003 (gmt 0)

10+ Year Member



Thanks wilderness,

if you don't mind, could you elaborate a little more on your answer.

1. The pages don't load fast enough:
-> what are the reasons for this to happen

2. The bot has limit settings on how long it waits for page delivery:
-> Are these settings specific to the bot or can they be change in another way (say through the operating system)

Thanks,
TK

wilderness

9:58 pm on Feb 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



1. The pages don't load fast enough:
-> what are the reasons for this to happen

Could have either too many images, images too large, entire page images and content tottal too large, even a troubled image which your site host is not serving up complete. Another example might be too much content. I had a page which was cut and pasted (it has since been removed) from an MS-Access file the page was 180kb and hardly any bot returned a 200 result on the page.
I'm inclined to think simialr things happen on the multitude of Flah and other media pages which reach near a meg in size :(

2. The bot has limit settings on how long it waits for page delivery:
-> Are these settings specific to the bot or can they be change in another way (say through the operating system)

Neither you or I have any control over bot settings (algorithms.)