Page is a not externally linkable
- Search Engines
-- Sitemaps, Meta Data, and robots.txt
---- Crawler software for really big sites


Josuah - 8:32 am on Mar 17, 2011 (gmt 0)


Does anyone have experience with a crawler software on big sites with around a million pages? When is not possible to do it by parts or categories.

I have tried <snip> and some other, but they can't manage all the data and the computers runs out of memory or similar.

Thanks.

[edited by: goodroi at 10:57 am (utc) on Mar 17, 2011]
[edit reason] Please no product mentions [/edit]


Thread source:: http://www.webmasterworld.com/robots_txt/4282856.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com