Forum Moderators: phranque
What I have is a TXT file, a list of URLs. In fact, it's a list of every URL in an entire website, about 3000 long, harvested using a crawler a few weeks ago.
Since assembling the list, the site has gone through a massive reorganization, moved to a new platform, software, CMS, and everything.
I need a tool that will load up that list, and smack each URL in the list to see if it's still OK. And report which ones are not. A simple HEAD request will do, I only need to know the HTTP Status: is it 200, 301, 404, 500, or what have you.
Is there any software that does exactly that?
wget - you surely have this installed somewhere already, and I'm pretty sure it can be put to this kind of use... but I'll let you work out the command-line options required ;) [gnu.org...]