Forum Moderators: open

Message Too Old, No Replies

Browsers getting 404 error

Some search engines convert mixed case to all lower case

         

Russ Dollinger

7:13 pm on Mar 29, 2002 (gmt 0)



I made the mistake of creating my website with lots of file names with upper case letters. For example XXX.html. Some browsers (for example All The Web, Vivisimo, etc.) show the link as xxx.html which results in a 404 error.
I tried creating a redirect page called xxx.html that would send someone to the correct page, but my HTML editor, and Windows, won't let me save the file to the same folder with the only difference being the case of the letters.
Is there some way around this, or do I have to bite the bullet and rename all the files to just lowercase and change all my links?

Lisa

7:25 pm on Mar 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The all important questions come first! What OS are you suing? What Web Server are you using?

korkus2000

7:30 pm on Mar 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



What browsers are case sensitive? IE 6, Opera, and Netscape 6 are not on win 95 through 2K. Is this a horrible new XP bug? My mac has never had this problem of case sensitivity either.

pageoneresults

7:32 pm on Mar 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



> Is there some way around this, or do I have to bite the bullet and rename all the files to just lowercase and change all my links?

You know, I've been using lowercase for everything since way back when. And with XHTML just around the corner, lower case and upper case will have completely different results when it comes to coding. I'm not sure what effect it has on file names, haven't studied that far yet.

I just find it much easier to use all lower case as it keep my fingers away from the Shift key!

P.S. I think all browsers are case sensitive when it comes to URL's. Go up to your address window and change something from lower to upper and hit refresh, the forums also return a 404.

(edited by: pageoneresults at 7:35 pm (utc) on Mar. 29, 2002)

diddlydazz

7:33 pm on Mar 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



<<<----Is there some way around this

Yeah save them as different names ie FILE as file_1

and then rename when they are on your server, I presume it is a remote host ?

Dazz

Russ Dollinger

7:35 pm on Mar 29, 2002 (gmt 0)



I am using Windows 98 and Explorer 4.72.

My ISP is using an Apache system.

I am using HotDog 6.5 as the web editor.

JayC

7:48 pm on Mar 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>>Some browsers (for example All The Web, Vivisimo, etc.) show the link as xxx.html which results in a 404 error.

I'm guessing that he doesn't mean "some browsers," but rather that some search engines are displaying the link with the uppercase characters converted to lowercase. So anyone clicking on that link, regardless of what browser they're using, gets a 404.

I don't have any sites using mixed case filenames so I can't test that theory, because I don't know of any sites in those named search engines for which that condition might exist!

Lisa

8:29 pm on Mar 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Well, if your ISP is running Apache. Then if you can alter the 404 page you could insert code in it to redirect.

/word.html (comes in)
(does not find document, goes to 404 page)
(inside 404 page you read the requested URL. Then you determine if lowercase was the problem)
(If it was a problem then you redirect back to the real file)
redirect /Word.html

But the long term solution is always use lower case in the URL.

If I notice a programmer who is working for me that doesn't use all lower case I tell them to correct it and never do that again.

I find most web programmers have this problem when they migrate from IIS to Apache.

IIS does not care what case you have.

(edited by: Lisa at 8:39 pm (utc) on Mar. 29, 2002)

Russ Dollinger

8:37 pm on Mar 29, 2002 (gmt 0)



Absolutely right. My major goof.
Search engines!!!

I would type in the web address that is getting screwed up, however, someone (wink, wink) would think I was spamming.

jdMorgan

4:19 am on Mar 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You could also use Apache's mod_rewrite to fix this. However, a much simpler fix would be to search for and download a freeware program called "Multi-search-and-replace".

It will change any string in any selected group of files to any other string. Just have it search for the uppercase URLs in your files and replace them all with lowercase.

I used the tool a year or so ago, for this very purpose. :o There are probably several other programs that will do the same thing - that's just the one I found.

Jim

Russ Dollinger

6:04 am on Mar 31, 2002 (gmt 0)



I bit the bullet.

I changed all my file names to lowercase; I changed all the references to lowercase; I resubmitted my pages to several engines.

I discovered that while windows does not allow files with only an case difference in the name, Apache does.
I did not specifically remove the Uppercase files from the Server. I converted all the uppercase files in my computer to lowercase, and then I uploaded the lowercase versions.
I noticed that the uppercase files are still there getting some hits.

The main culprit of the whole problem seems to have been the fast spider that Lycos and Alltheweb use.

bird

8:05 am on Mar 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I don't see that Alltheweb and Lycos have any problems with mixed case. Check any WikiSite: they tend to have FileNames in CamelCase, and I can't find any of those that aren't listed correctly.

Somehow I suspect that the internal links on your site were inconsistent with regards to case. This is no problem when you test it on a Windows box at home, but becomes one once you upload it to a *nix server. Whatever the real reason, converting all names to something consistent is definitively a good idea.

Russ Dollinger

4:07 pm on Mar 31, 2002 (gmt 0)



I publish medical language books. For example, I publish a book for French speaking patients. It is called Pocket Medical French. So I called the page, PMF.html. Lycos and Alltheweb turned it into pmf.html. I did not submit the pages that were changed. They were spidered. I think by fastspider. If you want to see for yourself, you can build the page address appropriately. I am not allowed to list it.
Other pages submitted to Lycos with mixed case had no problems.
I checked very carefully on my site. There were NO times that I used a lowercase version of the name.
A very experience head of another forum couldn't figure it out either. So ....
I switched everything.
"Wisdom is the booby prize for being unwise."
Piet Hein (Danish philosopher)

pageoneresults

4:21 pm on Mar 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Hmmm, I just noticed that the sites I manage which are on a Windows 2000 box with IIS5, return the same results whether you use upper or lower case. I'll assume that this issue is specific to the server configuration.

richlowe

3:54 pm on Apr 8, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I remember seeing an option in htacess or perhaps the apache configuration files to tell Apache to ignore case. Not sure what it was, but it might be worth checking out.