Forum Moderators: Robert Charlton & goodroi
and thank you in advance for any help, sitemaps is driving me insane
i downloaded this freeware program which creates a sitemap for you, and it has buttons on it that generate a map for you.
it's pretty popular program from what i can tell.
i will tell you exactly what i did, i know very little about this stuff so i'm going to be painstakingly detailed to make sure anyone who knows what's wrong can answer it (and it's probably very easy).
On the sitemap making program I extracted all the urls from my site (it actually helped me because many urls I found out weren't linked to anything and weren't getting spidered!).
I clicked "generate Google XML"
Clicked "file" - "save map as" - google sitemap xml
then saved the whole thing as "sitemap_index.xml" in my root directory
then i uploaded it via ftp
when i clicked submit sitemap in google it gave me the following error:
Parsing error (Line 2) We were unable to read your Sitemap. It may contain an entry we are unable to recognize. Please validate your Sitemap before resubmitting
The text at the beginning of my sitemap is as follows:
<?xml version="1.0" encoding="UTF-8"?>
<!--Google Site Map File Generated by [example...] of site building site.net Sat, 25 Feb 2006 16:49:04 GMT-->
<urlset xmlns="http://www.google.com/schemas/sitemap/0.84">
<url>
<loc>http://www.mysite.com/example/outbound/usa/washington-dc.html</loc>
</url>
<url>
and a list of urls in that format
it ends like this:
</url>
</urlset>
thanks for any help anyone can give, these things are giving me a headache, though i do owe a lot to that sitemap generator for catching all those unlinked urls
<urlset
xmlns="http://www.google.com/schemas/sitemap/0.84"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.google.com/schemas/sitemap/0.84
[google.com...]
Dashes inside an xml comment can sometimes confuse an xml parser. If the actual domain name inside the comment contains a dash or two, that may be the problem. Or, there may be some other confusing character in the part that you've edited.
However, it would be quite odd for the program to inject a problematic comment line, but you never know. There's nothing "off" that I can see in the rest of what you've pasted in. Might be good to run the output through your own validator. Google recommends the xml validation tools listed on the W3C site:
[w3.org...]
For the program to automatically update the sitemap, you would need to have installed the application on the server and then also automated it somehow -- you described none of those actions.
I have a similar problem. When validating find this:
Validation Status:
Your XML file at [mysite.com...]
Does Not Validate to the Google Schema Definition because of a parsing error.
--------------------------------------------------------------------------------
Reported Errors:
/sitemap/testbed/3aa8ad1cde2a1786b21e5d132dafed3a.xml:1: parser error : Document is empty
^
/sitemap/testbed/3aa8ad1cde2a1786b21e5d132dafed3a.xml:1: parser error : Start tag expected, '<' not found
^
In order to detect what's wrong I just made a sitemap03.xml containing this:
<?xml version="1.0" encoding="UTF-8"?>
<urlset
xmlns="http://www.google.com/schemas/sitemap/0.84"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://www.google.com/schemas/sitemap/0.84
http://www.google.com/schemas/sitemap/0.84/sitemap.xsd">
<url>
<loc>http://www.mysite.com/index.htm</loc>
<lastmod>2006-1-23</lastmod>
<changefreq>weekly</changefreq>
<priority>0.6</priority>
</url>
</urlset>
I have no clue of what to do really since all I do gives me same result...