Forum Moderators: open
Through research, we determined this was a compliancy issue (losing pages in Google's index). By having compliant code, the search engine spiders are more easily able to spider the content.
The solution? Make sure you use a DOCTYPE tag and an ISO Character Set Statement at the top of every Web page.
For example:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.1 Transitional//EN">
<META HTTP-EQUIV=content-type CONTENT="text/html; charset=ISO-8859-1">
[webmasterworld.com...]
It is not the presence of a doctype per se that will help, but the absence of markup errors confirmed by the validation process. Incidentally, the doctype listed above is incorrect, so you should use one of the doctypes listed here:
[w3.org...]
If you are unsure, you should choose the doctype for HTML 4.01 Transitional:
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd"> Declaring a character encoding (charset) is equally vital, especially if you are using a non-English language for your documents (but even for English is it important). The exact charset depends on the document language - ISO-8859-1 is a common encoding for many western European languages (English, French, etc.).