Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Tired of 404's for old pages

Should I just bring them back online?

         

icedowl

4:13 pm on Jul 17, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



In February of 2005 I switched my main site from purely static pages to dynamic database driven pages (Mambo, now Joomla). The reason was that the old static version was getting to be a major headache to update and add new pages. At that time there had been around 400 pages. The site has now more than doubled in size due to the ease of adding new content.

Google still lists a few of these old pages in search as if they were still online. Not as supplemental either.

Links to these old pages from other sites still exist. I guess a lot of site owners don't check their links very often.

I've considered doing 301 redirects for those old pages, but the sheer quantity of them seems like it would be a nightmare to accomplish. It would also take me a month of Sundays and then some to add that many 301's. Not a task I really care to do.

I've tried simply blocking access to the folder that they had been in with no luck. Google just doesn't want to forget them.

If I bring them back, should I add a note to each page (a small task) to let any visitors know that they are on old pages?

Opinions? Possible downsides? Possible Upsides?

icedowl

10:24 pm on Jul 18, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I guess nobody knows. :(

SuddenlySara

10:45 pm on Jul 18, 2006 (gmt 0)



So do you have duplicate page names one as .html and the new ones .php or something?
If the old page names are not duplicate of your new page names and the old pages have unique content
and links to these pages...Bring them back in the navigation structure of your site?

Quadrille

11:36 pm on Jul 18, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If you restore them, and you also have newer versions, then the duplicate issue will cause greater headaches, for longer.

Be sure all the pages (all orphans) are not only 'unlinked' but physically removed from the server - xenu is your friend.

Make sure you have a quality 404 page, that should not be a worry.

Monitor your stats; you''ll find the number of visitors to those pages is small and diminishing, as newer pages (that really exist) will tend to be above them (or replace them) in the serps.

If any pages do get more than a handful of visitors, 301 those pages alone.

'Ghost' pages exist more in the serps than in the real world - webmasters see them becase thy search for domain names. Visitors, searching for keywords, will rarely visit - and in most cases, a quality 404 page will suffice. It doesn't have to look like a 404 - visitors will think it's simply an index.

But check the stats to 301 any that really need it (very, very few, I bet)

icedowl

12:22 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thank you. I'll just hope that they eventually die off in short order.

They definitely do not exist on the server and haven't for 17 months. I do have a 404 page that has a link on it to the home page of the current version of the site and good navigation to follow from there.

Page names (URLs) are entirely different. All of the old content was brought over to the current version, so those 400+ pages could be seen as duplicate content.

But check the stats to 301 any that really need it (very, very few, I bet)

That amounts to about 40 pages at last count.

Thanks!

Quadrille

1:05 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Sounds as if you have most bases covered, though more 301s than I'd have expected :)

Why not put some of those links on the 404 page; saving visitors that one click can make a serious difference - and arriving at an 'obvious' 404 page is not the best intro to your site.

May be worth double checking that none of your pages have links to the 'dead' pages - it's links that keep the ghosts alive, and 'internal' links are the worst!

icedowl

2:53 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Nope, no links to those old pages exist on the current site so that's not a concern. Putting links on the 404 page is an idea I hadn't thought of, although I have tried to cover most of them on the home page. It's just frustrating that Google can't seem to forget them.

jomaxx

3:10 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You really need to fix your workflow. You should be able to set up 400 redirects in an hour or two at most. Get a powerful text editor, learn how to use macros and the DOS command line, add some imaginative problem-solving and you're off to the races.

Personally I hate serving up 404's, either to humans or to SE spiders, and even at this late date I suggest you go that route.

icedowl

3:35 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Well I'm not so sure it would only take an hour or two, but it can be done. I do work a full-time job and have sleep, etc. to also take care of.

Here's what I think it will take to accomplish this task:
1. All of the old pages are here on my PC. I simply have to locate the files that they are stored in and grab the URLs.
2. Collect the current URLs that are the equivalent of the old URLs.
3. Build a basic 301 that I can copy/paste and then plug in the proper URLs, matching old to new.
4. Paste the entire thing into my .htaccess, which sounds like it will be a huge addition. Do folks usually have such large .htaccess files?

I go back to the early 1980's with DOS but I just don't see how DOS could play a helpful part in this.

Well, I guess I'm off to the races. So to speak.

jomaxx

4:04 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If all the pages are in the same directory, go to DOS and do something like "dir *.html >files.txt"

Bingo. Some simple editing and you've got a text list of all the page names in just a minute or so. That's the basis of your .htaccess file.

If they're in multiple directories, you need to go to the lowest one and do a "dir /s" instead, and massage the resulting file.

That may not be faster than whatever you're planning on doing, but that's why I brought DOS into it.

mcavic

5:49 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If you want to keep as many visitors as possible, 301 redirects are really the only way to go.

g1smd

10:12 am on Jul 19, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Make sure that all of the 404 pages really do return a 404 status code in the HTTP header.

Lingering 404 pages are often caused by the fact that the 404 page is really a 302 redirect rather than a true 404 error.