I recently started working for a medium sized city with a gigantic, overgrown, and bloated website. Now I know all of you reading this are asking yourself "How bad can it be?" At last count earlier in the month we had over 14,000 pages only including HTML, CFX, and ASP pages on the internet and almost 10,000 HTML pages on the intranet. The previous web master had no idea how to organize, and truly manage a website, so he went about helping all of the departments train a web developer, and then pretty much gave them run of the ship. Needless to say, I'm the big ugly Heavey that's been hired to get things inline.
To first address this problem and help control this in the future we are moving to Microsoft CMS and implementing Urchin so that we can track page visitors and such. Because of the sheer size, we expect our implementation to take 18 months after having all of the templates created and the front and main pages redesigned.
The next tool that I see that we will need to move this project along is a link checker. It's most important job will be to find orphan files, so that we can dust off all the junk before we try cleaning up the rest. It should also be able to follow relative and static links, and different domain names (a whole other mess) seeing them as the same web. I also need it to create a text based report so that I can then use it to create an archiving script. As with any person who works for government, I want it to be free and do most of the work itself, so that I can find other ways of spending tax dollars. (That's a joke by the way.)
I don't have many requirements do I?
Any suggestions you have that meet any of the requirements would be greatly appreciated.