Forum Moderators: open

Message Too Old, No Replies

Template pages not being indexed by Google

         

tintin99

10:41 am on May 26, 2004 (gmt 0)

10+ Year Member



Hi,

I have a couple of sites which I've designed using Dreamweaver templates and I've found that Google has only indexed the header and footer (ie the template bits) but not the page content.

I've been searching for previous postings of this problem but every post I've found on the subject suggests that it should work OK.

Does anyone have any ideas of what I should look for?

Many thanks in anticipation or your help!

Bob.

troels nybo nielsen

12:06 pm on May 26, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to Webmaster World, Bob.

I wonder if you may have technical problems. Have you validated those pages? And do you have a robots.txt file?

Marcia

3:52 pm on May 26, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I've seen several sites with similar problems, and it isn't just DW templates. It can happen where there is no sign of DW template or Library item. From what I've seen Google just doesn't seem to like the very top of the pages being identical all across the whole site lately.

Wail

4:07 pm on May 26, 2004 (gmt 0)

10+ Year Member



From what I've seen Google just doesn't seem to like the very top of the pages being identical all across the whole site lately.

What an odd thing for Google to decide to do. I imagine a large percentage of the Internet is like that. I do suspect that Google is being reluctant to do more than just peak at the first page or two of new sites, though.

tintin99

4:32 pm on May 26, 2004 (gmt 0)

10+ Year Member



Thanks for the welcome and the replies!

Yes, it's strange. One site is older than the other and I'm fairly sure that it used to be indexed OK on Google, but it certainly isn't now.

I just tried validating the pages and there were a few fairly minor errors (TYPE and ALT tags missing) but not enough to stop Google I think. I did have a robots.txt file on the more recent site but removed it recently in case it was causing the problem.

The more recent site also has a different title for each page so you would think Google would pick up on that and recognise that the pages were different.

At the moment the only fix I can think of is to detach the pages from the templates.

Bob.

PatrickDeese

4:36 pm on May 26, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



What I have observed is that it seems that G grabs the first few bytes of a page and if it "tastes" the same it will classify the pages as "similar" to one another.

I had a problem with a site with several hundred pages being "ignored" like this and solved it by moving the <h1> tags higher up in the page and rewriting them a bit so that G recognized the content as unique.

They used to read something like

<h1>Example.com's Page About Red Widgets</h1>

and changing them to read

<h1>The Red Widgets Page at Example.com</h1>

seemed to help alot as well.

tintin99

4:54 pm on May 26, 2004 (gmt 0)

10+ Year Member



In fact the summary that Google gives consists of the first few words of the header and the last few words of the footer. If I click on the 'Similar pages' link it says 'no other pages found' or words to that effect!

I'm not sure if I'm allowed to post urls here...perhaps that might shed some light?

Bob.

rogerd

5:20 pm on May 26, 2004 (gmt 0)

WebmasterWorld Administrator 10+ Year Member



Don't post the url, but do try some tools to see what might be causing this problem. Check the Search Engine Tools [searchengineworld.com] page and try the Sim Spider and HTML validator for starters.

PatrickDeese

5:30 pm on May 26, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Just occurred to me - perhaps you accidentally allowed the template call pages from your own hard drive.

I had a client that made a site in front page and it looked fine from his computer, but it was broken for everyone else because everything was being called from his own hard drive

<img src="C:\Documents and Settings\User\Desktop\Foo\photo.jpg">

He was going crazy because it looked fine to him, but was broken to everyone else.

tintin99

6:02 pm on May 26, 2004 (gmt 0)

10+ Year Member



All looks fine on Sim Spider. I can see all the other linked pages and spider them as well. All the meta tags come up fine too. It all looks as I would expect.

Other people I know have viewed both sites with no problems so the pathnames must be OK.

The HTML validator returned an unexpected server error, but I tested pages on W3C and did get some errors as I mentioned before but nothing serious (eg missing 'alt' tags).

Wierd!

Bob.

Marcia

6:36 pm on May 26, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



By any chance Bob, do you happen to have a header graphic up top that links back to the homepage with the same keyword phrase in the alt attribute throughout the site?

Patrick, I've seen a lot of pages out there with the snippet bollixed up like that. Nothing to do with linking to the HD - they've just got the alt attribute listed with ... after it in the snippet and it stops short there - that's the whole description.

tintin99

8:05 pm on May 26, 2004 (gmt 0)

10+ Year Member



I think you're onto something Marcia! On one site that's exactly what I have and the first part of the Google description is in fact the alt tag for that image. This site does have a different title on each page though.

For the other site, now that I look at the Google entry, I find that the description is just the footer on its own. This is just a set of text-based hyperlinks and a copyright notice. The header on this site (which contains an image with no link or alt tags and text-based hyperlinks) has been ignored. This site has the same title on each page.

Does that give you any more clues?

Bob.

wanna_learn

11:17 am on May 27, 2004 (gmt 0)

10+ Year Member



I suspect that lately there has been some bad treatment by Google where the situation is ::
Similar Header area of site , I mean similar Image, Alt, and obviously lot many webmasters Link the Top Image with Home page...so similar Link.

Google dont seem to be liking it anymore or have unknowingly messed up somewhere.

caveman

11:50 am on May 27, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I also agree that Marcia could be on the right track. We've got an older site that had a bunch of subpages vanish recently. The pages were all from a single template (not DW). We knew a lot less then, so even titles and META's were nearly identical.

I think G decided that all these pages were too close to one and other. We're waiting right now to see if our fixes did the trick.

Marcia

3:44 am on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ah, finally found this thread! There have been more and more pages disappearing, a lot of them being templated type pages.

There's a little bit more of an insight available now since Google has started showing a text-only version of the cache. When the images are linked, the alt attributes are showing up just as they would if they were regular text links.

So not only is it the factor of the page top being identical throughout the site, but if the alt text happens to be the main keyword phrase for the site and there are a lot of other links with the same anchor text it can go well over the top for percentage of occurences of identical anchor text.

It's harder to spot and not easy to find too many, and it doesn't have the same significance, but alt attributes in graphics that "aren't" links are also turning up here and there - they just look like normal page text so they don't stand out too much and it's harder to catch, especially if there's a lot of text on the page.

ogletree

3:48 am on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I have thousands of pages with the same top and I rank great.

Marcia

4:54 am on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



ogletree, do they all by any chance have a graphic with keyword alt text that's linking back to the homepage?

I'm trying to get to the bottom of it because I've seen a good number running into problems and I personally always expect the top graphic to link back. Plus, it creates difficulties for people using something like Dreamweaver with a library item up top.

my3cents

4:56 am on Jul 8, 2004 (gmt 0)

10+ Year Member



I don't use any templates and my site is a few years old.. I recently started seeing goole have several of my pages with the description "ARTICLES ABOUT US CONTACT US PRIVACY POLICY SIGN IN HOME PAGE" which is the text on the top line of navigation for my entire website. I am also seeing several pages have the description "CONTACT US HOME PAGE...(c) Mysite.com 2000-2004 all rights reserved" which is the bottom line of text on every page of my website.

The entire site is hand coded, no templates, librabries or anything else other than straight html and a few images. All of the pages showing this have at least 500-1000 words of content related to the title and meta description of the page and all of these pages have lost their rankings.

ogletree

5:01 am on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I don't have any alt text. There are several links in the template. There is a logo that goes to the home page and several other links to support, sign up, my account noraml ecomerce stuff. I don't use DW. I just have php includes. My pages are created using a MS access db and a perl script. I know there are better ways of doing it but I am not a web developer I use what I know.

Marcia

5:41 am on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>>I don't have any alt text. There are several links in the template. There is a logo that goes to the home page

OK, so then there's no alt attribute being picked up as link text from the logo that goes to the homepage.

tintin99

7:56 am on Jul 8, 2004 (gmt 0)

10+ Year Member



I recently discovered this tag:

<!-- robots content="noindex" -->
<!-- /robots -->

...which you can wrap around an area of the page that you don't want search engines to index. I'm going to try adding it around my headers, footers and menus to see if it makes any difference.

Any thoughts on whether it should work?

Bob.

IRWINjim

11:33 am on Jul 8, 2004 (gmt 0)

10+ Year Member



I had a similiar problem. My issue was that I named the DW templates area 'description', 'title', etc.

What I found was that Google was reading the template area as the 'description' and nothing was in the area. It missed the real 'description' and 'title'.

I've sinced renamed my template area and things are good.

billygg

2:13 pm on Jul 8, 2004 (gmt 0)

10+ Year Member



question for everyone, this is kinda a serious issue here that i came acrossed. i work for an online travel company. this is my question, we put the site up about a year ago, and slowly are adding things. we have over 3,000 dymanically driven pages of content built in a directory fashion. google LOVES the site, right now. we have same top nav, same logo with a link back, no alt text, every page has different content, hotels, property listings etc. we did a dynamic descrip and keywords tag bout a month ago, google loved it, picked them up, all different to the context of city name and what is offered in the area. my question is, just recently we added h tags. h1 and h2. we also did this all dynamic. now, in the h1 and 2 they are pretty much identical, except city name changes. this is acrossed the board, for all pages. we just did this bout 2 days ago, will this type of h tag get us blocked on subpages. every subpage is different per city, titles everything, but, the h tag is almost identical, cause its dynamic. except city name which is pulled in based on id. any suggestions, im afraid after reading this, it could be an issue. :(

bether2

2:20 pm on Jul 8, 2004 (gmt 0)

10+ Year Member



Almost all of the pages on my main site use Dreamweaver templates. Not the same template for all pages, but in one case about 300 pages use the same template. Have not had any problems with being spidered, indexed, rankings, etc.

Heh, now I'm starting to worry...

Beth

billygg

5:17 pm on Jul 8, 2004 (gmt 0)

10+ Year Member



i know, i read this, and i worry, LOL, it it should be all good. lets hope :)

ogletree

6:31 pm on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



billygg sounds good. I would not worry about it. As long as your pages have at least 15% difference in them you should be ok. I think where some people have problems is where they have a template and very little content on the page. You need to have several paragraphs that are at least 15% different on each page. Also I would add links to other internal pages on each page. If you have a dynamic site you may have domain.com/page.php?=asldkfj type URL's. if you can get away from that using mod rewrite or ISAPI rewrite for MS that would help as well. How often do you get a total site spider? How many pages get spidered a day?

HayMeadows

6:38 pm on Jul 8, 2004 (gmt 0)

10+ Year Member



I have thousands of pages with the same top and I rank great.

100% agree....Webmasterworld has a lot of pages with similar bikini's as well. No problem ranking here.

4eyes

6:41 pm on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I have thousands of pages using the same template.

Mine are still OK, possible because I have used css to position the different high up in the HTML.

Might be worth someone changing a page or two to see if the problem is still there when the similar bits are further down the html.

jcoronella

7:37 pm on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I guess my question would be how unique is the content part of your page? Is it unique to your site?

BeeDeeDubbleU

8:42 pm on Jul 8, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



My site has over 80 pages all with the same masthead and alt text linking back to the home page. These are created from a common DW template. I am currently ranking very high, with number one positions for some of my keywords.

I would be very surprised if there was a Google penalty of this since, (as someone said earlier), half of the internet must have been created this way. I would assume that the boffs at the 'plex would have considered this. Surely they would know that they would be damaging many legit sites if they penalised this?

This 42 message thread spans 2 pages: 42