Forum Moderators: open

Message Too Old, No Replies

Google is not indexing/including a web page

What could be happening?

         

Gusgsm

11:22 am on Feb 24, 2004 (gmt 0)

10+ Year Member



Good morning,

I am perplexed. There's a given page that gives way to 5 subpages on a subject. No one of them appears in Google even if I search them with an 'special' term that I have included in the keywords so as to locate them (it's just my own name).

The starting code is:

<?xml version="1.0" encoding="ISO-8859-1"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="es">

<head>

<title>C&oacute;mo enviar los ficheros a la imprenta (1)</title>

<meta http-equiv="content-type" content="application/xhtml+xml; charset=ISO-8859-1" />
<meta name="generator" content="Arachnophilia 4.0" />
<meta name="audience" content="all" />
<meta name="robots" content="all" />
<meta name="distribution" content="global" />
<meta content="Gustavo S&aacute;nchez Mu&ntilde;oz, 2003" name="copyright" />
<meta content="
preflight,preparar,imprenta,fotomec&aacute;nica,preprensa,preimpresi&oacute;n,filmaci&oacute;n,filmar,imprimir,
separaci&oacute;n,colores,CMYK,RGB,indexado,color,directo,spot,punto,trama,lineatura,pantalla,
resoluci&oacute;n,cu&aacute;nto,cu&aacute;nta,necesario,ppp,ppi,lpi,lpp,PostScript,tinta,
ganancia,curva,env&iacute;o,tutorial,c&oacute;mo,apuntes,Gustavo,S&aacute;nchez,Mu&ntilde;oz,
digital,imagen,dibujo,gusgsm,artes,gr&aacute;ficas" name="keywords" />
<meta content="Un resumen de lo que hay que tener en cuenta al preparar un fichero para imprenta." name="description" />
<link rel="stylesheet" href="../estilos/gris.css" type="text/css" media="screen" />
<link rel="stylesheet" href="../estilos/dina4.css" type="text/css" media="print" />
</head>

<body>
...

All the pages validate all right as xhtml 1.1 and I'm not trying to reach a relevant position in Google or any commercial stuff. It's simply that it seems that I have broken a rule without knowing it?

Thanks in advance :)

PS. The carriage returns have been included in the keywords here so as not to break the browser window. The original does not have them.

creative craig

11:26 am on Feb 24, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Is the page taken from a database? Google can have problems indexing pages that have long urls from a database.

How long have the pages been online? Are the pages linked to from other pages that are spidered ok?

Craig

grahamstewart

11:31 am on Feb 24, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Given the number of entities appearing in your keywords (e.g. &acute;) why don't you use the UTF-8 encoding for your page instead of ISO-8859-1?

That way you can just use the actual characters instead of their codes. The resulting page will be smaller and much easier for you to maintain.

This may be relevant because I'm not sure how Google will respond to a keyword that contains an html entity, but it should respond correctly to normal UTF-8 characters.

Gusgsm

11:32 am on Feb 24, 2004 (gmt 0)

10+ Year Member



Craig,

Is the page taken from a database? No.

How long have the pages been online? Let's say two months and a half roughly.

Are the pages linked to from other pages that are spidered ok? Yes, that's the odd thing. The other pages are indeked all right. Not that they are popular or anything like that. It's simple that this particular page is nada - nihil - zero for Google.

I remember know that there was a system (an url or something like that) to know if Google bot has passed trough a given page. Would you like to brush up my memory, please? I am a bit Alzheimered today.

ta :)

Gusgsm

11:34 am on Feb 24, 2004 (gmt 0)

10+ Year Member



Graham,

As a matter of fact, I changed the encoding from UTF8 to ISO-8859-1 because I saw that my browser (IE 6 in W2K) was sometimes showing them in a wrong way (Chinese characters mixed with Latin ones) and with the ISO enconding that stopped altogether.