|For all sitemap lines (URLs), can you just cut out the http://domain.|
For all sitemap lines (URLs), can you just cut out the [domain.com...] and, instead, leave the /page1.html etc etc ?
We have several alias domains, and this would be a LOT easier than having to "commit" to a single one of those domains. This way, without the specific domain, the page 1.html will assume (will it?) whatever alias is being spidered/page1.html.
The other advantage is that we'll only need 1 sitemap, instead of having to do [alias1.com...] before all of the sitemap page lines, and also instead of having to include the links to the other alias domains' sitemaps (which, obviously are identical).
In other words, do we have to put the is it (?) "canonical", I mean the "http://www.alias1.com" before every page, or, can we just start each page with a slash, and the spider will assume the domain it knows it's already crawling?
Thanks. Basically, I tripped across this questoin in converting my site's canonical links (or formal [domain.com...] to without those precursors so that all links work perfectly on ALL of our alias domains for that site...but then I realized, when I did my find and replace (I replaced the [domain.com...] with nothing) that it also replaced all those strings in my sitemap.xml. SO! Question is: will Google's crawler allow a list of pages like:
If so, that would save me having to create (actually I already have them) sitemaps for all the alias domains. Simply put, I'd like to just use one sitemap if possible with relative domains. Can I do this?
This is a much simpler version of the question: Can sitemaps be done using relative paths? Or only absolute paths? Thanks.
|<loc> required URL of the page. This URL must begin with the protocol (such as http) and end with a trailing slash, if your web server requires it. This value must be less than 2,048 characters. |
why do you want multiple alias domains crawled by google?
Because a ton of users arrive via any of our other domains.