Forum Moderators: phranque
I recently took on a client who's had several people work on their site in the past, and all of them left a mess behind of old, now un-used, files.
I was wondering what ya'll could recomend to a good way to get an index of all the currently available pages and images on the site, so I can delete everything that's not on that list.
I essentially want to crawl my own site and get a list of local file names, including images. It would make life a heck of a lot easier.
My guess is that there's some nice *nix command line tool that would do the trick with some crafty shell command, but I'm overlooking it.
Any tips would be appreciated. T.I.A.
You should check the documentation first to get the right syntax, but I believe is is something like:
wget -r -p --random-wait http://www.example.com/