Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Subdomains and Sitemaps

Should you have 1 sitemap per subdomain

         

boskeo28

6:28 pm on Oct 10, 2006 (gmt 0)

10+ Year Member



I'm a new member -- I've learned a great deal from these forums. Thanks everyone.

My site is a niche financial site. Let's say my main domain on which I have adsense is: <snip> I have submitted a sitemap to Google via Google Sitemaps and they have indexed most of my pages, if not all.

Now let's say I have a subdomain like charts.EXAMPLE.COM -- do I need to submit a separate sitemap for this? The thing is, I have submitted a separate sitemap for this subdomain, but it isn't being crawled....at least most of the pages haven't been indexed. Yet.

Overall I currently get between 500 and 1500 page views a day after about 18 months in the persent form, and just had my first $100 month recently. Any tips for financial sites? Theoretically I have thousands of pages, since information is generated for stock tickers, but these dynamically-generated pages don't contribute much in terms of Adsense income, so I'm working on building a lot more static content.

[edited by: martinibuster at 7:25 pm (utc) on Oct. 10, 2006]
[edit reason] Removed URL. See TOS [webmasterworld.com]. [/edit]

Car_Guy

6:32 pm on Oct 10, 2006 (gmt 0)

10+ Year Member



Include everything in one sitemap.

jay5r

7:22 pm on Oct 10, 2006 (gmt 0)

10+ Year Member Top Contributors Of The Month



They must be in separate sitemaps because Google will ignore any URLs in the sitemap that are not within the directory that the sitemap was served from.

For example the sitemap http://www.example.com/users/~fred/sitemap.xml can only contain URLs in the directory http://www.example.com/users/~fred/

So, by extension you can't put the URLs for one host in the sitemap of another host - hence one (or more) sitemap per subdomain.

For the authoritative source for this policy can be found at the follow URL under the heading "Location of Sitemap Files"...

[google.com...]

"The location of a Sitemap file determines the set of URLs that can be included in that Sitemap. A Sitemap file located at http://example.com/catalog/sitemap.gz can include any URLs starting with http://example.com/catalog/ but can not include URLs starting with http://example.com/images/."

boskeo28

9:11 pm on Oct 10, 2006 (gmt 0)

10+ Year Member



Thanks....I guess separate sitemaps are correct after all.

(Sorry, didn't know that the domain I used as an example was a *real* one! Shoulda checked.)

Car_Guy

11:18 pm on Oct 10, 2006 (gmt 0)

10+ Year Member



Google will ignore any URLs in the sitemap that are not within the directory that the sitemap was served from.

I wasn't aware of that, but now that you've mentioned it, things would get pretty crazy if it wasn't done this way.

Thanks for clarifying.

RichTC

11:36 pm on Oct 10, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Car_Guy,

In effect Google will treat the sub domain as it would a new detatched site - unless you have lots of content specific to the sub domain it might be best to keep the content within the main site.

You need another site map because its a different site.

Im not a big fan of sub domains, we work on one site that has 12 subs but it holds a mass of specific content and can support it, used correctly the subs can be a good idea but often i see them used by webmasters the wrong way ie:-

Currently a site that has a sub domain for every area of the uk ranks in google. I find the entire site spammy as hell, it has no unique content just data from other sites its part of and it doesnt need this volume of sub domains but does it for seo purposes - Google should crush this kind of site but it ranks well in google (for now).

Good luck

dasuzer

12:14 am on Oct 12, 2006 (gmt 0)

10+ Year Member



Should I make a sitemap.xml for each directory?

example.com/

example.com/abc_State/
example.com/def_State/

Would I make a seperate sitemap.xml for each state directory? Or should I have One sitemap.xml that cover both states in the main example.com/ directory

Thanks for your help

dasuzer

1:42 am on Oct 13, 2006 (gmt 0)

10+ Year Member



Thanks for all your replies to me :)

jay5r

3:08 am on Oct 13, 2006 (gmt 0)

10+ Year Member Top Contributors Of The Month



dasuzer - not unless you've got a good reason for it (like you need to break up a big sitemap, or you use different tools to create the sitemaps for each directory)...

webdeb

3:15 pm on Oct 13, 2006 (gmt 0)

10+ Year Member



Hi there....
another lost soul looking for answers ;-]
Our 10 year old site dropped off the face of the google-world....
It is a Doctor's site with health information as well as products for her practice.
Above-board, legit and long-standing.
Win 2003 IIS.

My client decided to have SubPlus create sitemaps and such and do a submission campaign.....without talking to me...... because she saw a huge drop in traffic over the summer (sounds like the subject of a few other threads here ).....
The first submission was Aug 25th and I didn't know that it has happened.

The week of Sept 10, I was in production on a CFusion upgrade to the site.
I just changed our product catalog from .htm to standalone .cfm pages.
No passed parameters except to the checkout engine.
The .htm pages have been between #1 and #10 in Google for YEARS.
The .cfm pages still have the same < head > content as the .htm files they came from. Verbatim.

The DAY I flipped to the new pages I was told that there would be 'another round' of submisions that day.

The sitemap SubPlus rendered had every file in the site listed including duds and pages with our IP..... (we've never before submitted a map to anyone)
FAT mess.....
oh, yes, and SP did it without adjusting the robots file
Suddenly we are no longer found by Goo (just Goo) AT ALL except by IP and by ONE keyword phrase for thewayup.com which brings up a subCatalog page.
try it --> site:thewayup.com

I took the sitemap and scrubbed it and the directory clean , as well as all dead links old prod pages off of the server or reDirected appropriately.

so what the heck's my question?......

about procedure.
I can't find anywhere in one place an accepted list of how to maneuver around with the sitemap (what should go on it).
I have been pulling my hair out about which way to write the robots file to avoid having the redirected and new pages seen as dupes in the sitemap, and not somehow seem to be spamming.

PHYSICAL PAGES - dupes vs spiders
I have diligently done individual reDirects for all of the old .htm files.
Do I REMOVE the physical file or LEAVE the file?
If I leave it, which of the following are best practice:
strip it (like the Validation file Goo uses)?
....or
does there need to be a link, or just a reference, on the page to the new one [ not a < meta > reDirect]? eg: "page has moved to"
....or
do I leave the < head > content as it was with the previous ranking so the page is stil found?

ROBOTS.txt -
would I write the disallow for the .htm or the .cfm? [reDirect is from .htm to.cfm]
--> intent : to keep the page rank for the .htm pages ..........it's way too late for me to do it Right-the-first-time, but maybe tthe answer will save someone else a headache......

SITEMAP -
do I include the .htm or .cfm pages? (which, of course relates directly to the robots file)
.... and
do I use [thewayup.com...] or [thewayup.com...] in the map if our site is in DMOZ, and therefore Goo, as [thewayup.com...]
......or
can I use relative paths to avoid any confusion between the two?.... I know that the SE's see [thewayup.com,...] [thewayup.com...] and [thewayup.com...] as different URL's , so I imagine relative wouldn't work... but it might...

....and

related........

PAGE RANK
Goo Splits PR between [thewayup.com...] and [thewayup.com....] Correct? If we've been doing the same thing for all of these years, why now do I need to 301 [thewayup.com...] to [www....] thewayup.com? or do I need to?.....
....and
Can one link damage PR on a "Related Links" page with 120 other links?

PTR
As I 301 [thewayup.com...] to [thewayup.com,...] do I need to change the PTR? should I leave it alone?

please know I have spent a month in 'crash-course' mode ... researching...
I promise I'm not starting from scratch ;-]
Thank You in advance for your answer