Forum Moderators: phranque

Message Too Old, No Replies

Inherited Mess, easy clean up?

Quick way to index orphaned files?

         

riscit

1:35 am on Jan 27, 2005 (gmt 0)

10+ Year Member



Hello folks!

I recently took on a client who's had several people work on their site in the past, and all of them left a mess behind of old, now un-used, files.

I was wondering what ya'll could recomend to a good way to get an index of all the currently available pages and images on the site, so I can delete everything that's not on that list.

I essentially want to crawl my own site and get a list of local file names, including images. It would make life a heck of a lot easier.

My guess is that there's some nice *nix command line tool that would do the trick with some crafty shell command, but I'm overlooking it.

Any tips would be appreciated. T.I.A.

encyclo

1:50 am on Jan 27, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If you want to rip your own site, then you could use wget. If you don't know it, it's a Unix/Linux command-line program, although there is also a Windows version available.

You should check the documentation first to get the right syntax, but I believe is is something like:

wget -r -p --random-wait
http://www.example.com/

riscit

2:10 am on Jan 27, 2005 (gmt 0)

10+ Year Member



Works great. Thanks.

> there is also a Windows version available

heh.. that's nice. ;)