homepage Welcome to WebmasterWorld Guest from 23.20.63.27
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
Webmaster Tools - Weird Duplicate Title Problem
mtreasure




msg:4584863
 11:15 am on Jun 17, 2013 (gmt 0)

Hi,

In my Webmaster Tools, (Optimisation / HTML Improvements) I'm getting a lot of pages appearing with 'Duplicate Title Tags'. However, they are the same page, so I don't really understand why Google is thinking that it is more than one page, e.g.

WebmasterWorld Information: We don't allow specifics
/WebmasterWorld/google-seo/no-specifics/
/WebmasterWorld/google-seo/no-specifics/index.html


So, it is treating the folder '/' as one page and '/index.html' as another, according to their report. However, this is only for some of the pages.

Please let me know what I should do to stop these 'errors' appearing on Webmaster Tools. I can't see why it is thinking that there are Duplicate Title Tags in the first place.

Please help me! Many thanks,

MARTIN

[edited by: goodroi at 12:28 pm (utc) on Jun 17, 2013]
[edit reason] Welcome to WebmasterWorld, now please go read the forum charter [/edit]

 

Savanadry




msg:4584917
 1:38 pm on Jun 17, 2013 (gmt 0)

Are you using wordpress? If you are it would be very easy to add canonical links through a plug-in like yoast.

If you're not using wordpress then you need to add canonical links somewhere in the header of both pages. Both leading to the same correct page.

<link rel="canonical" href="http://www.example.com/WebmasterWorld/google-seo/no-specifics/>

For more info:-
[support.google.com...]


BTW on a technical level if only one page actually exists it seems there are some probs with your htaccess file - but a quick fix would be to just add canonical.

mtreasure




msg:4584919
 1:56 pm on Jun 17, 2013 (gmt 0)

Many thanks Savanadry.

Yes, there is just one file, i.e. index.html .

i.e.
http://example.com/europe/england/somerset/bath/
http://www.example.com/europe/england/somerset/bath/index.html

They are both pointing to the same file. Hope that helps to clarify things a little better. What should I do to correct this, and why is Google thinking that they are two pages? :)

Cheers,

MARTIN
.

[edited by: Robert_Charlton at 9:04 pm (utc) on Jun 17, 2013]
[edit reason] examplified domain [/edit]

mtreasure




msg:4584920
 1:57 pm on Jun 17, 2013 (gmt 0)

Meant to say, no, not Wordpress.

netmeg




msg:4584921
 2:03 pm on Jun 17, 2013 (gmt 0)

Google thinks they're two URIs (not pages) because they ARE two URIs.

Think about it - in the bath directory, you could have an index.html, an index.htm and an index.php and they could all be very different. Which file is the default just depends on how your webserver is configured (and Google won't know that) So it takes everything it finds and indexes it separately.

I would probably make sure the /bath/index.html redirects to /bath/, and so on for directory URI. I don't want any index.anything in Google.

mtreasure




msg:4584929
 2:12 pm on Jun 17, 2013 (gmt 0)

Thanks. I put this in the .htaccess file last week:

DirectoryIndex index.html

Would that not fix it? It's much easier for me to link to the files as .index.html . I really appreciate your thoughts here.

netmeg




msg:4584945
 2:34 pm on Jun 17, 2013 (gmt 0)

I would never link to the files as index.html Never never never.

And no, I don't think that .htaccess line will do it. If you're using Apache, best to ask in the Apache forum.

mtreasure




msg:4584947
 2:38 pm on Jun 17, 2013 (gmt 0)

Thanks.

>I would never link to the files as index.html Never never never.

Why do you say that? I've done it for ten years and get around 3 million uvs a month, so it never seemed to do any harm. This is a new thing that has suddenly started happening, after I've done a bit of restructuring. If there is a good reason not to do it, then I'll remove the index.htmls.

mtreasure




msg:4584952
 2:58 pm on Jun 17, 2013 (gmt 0)

I really do appreciate your help here and would love to get this sorted. If I take out all links to index.html - do you think that this will resolve this problem over time?

netmeg




msg:4584994
 4:41 pm on Jun 17, 2013 (gmt 0)

Read this. Twice.

[webmasterworld.com...]

Robert Charlton




msg:4585086
 9:18 pm on Jun 17, 2013 (gmt 0)

After you read that one twice, give this one a shot as well....

Duplicate content from index.html
http://www.webmasterworld.com/google/4452586.htm [webmasterworld.com]

-----

WebmasterWorld Information: We don't allow specifics

And a mods note: I also recommend that you reread the Google Forum Charter [webmasterworld.com], which explains our linking policy. We do not offer public site reviews. Use example.com instead of your own domain.

I suggest you read the Charter two or three times as well.

mtreasure




msg:4585185
 6:22 am on Jun 18, 2013 (gmt 0)

Thank you. This is all very helpful. I have removed all links on the site to index.html now. Will this resolve the problem in time, naturally, or should I put something in the .htaccess file or similar? If so, what would you recommend?

[edited by: mtreasure at 6:25 am (utc) on Jun 18, 2013]

mtreasure




msg:4585186
 6:24 am on Jun 18, 2013 (gmt 0)

I should have said that the link to index.html was on around 12,000 pages. I'm guessing it will sort itself out in time, but what about other sites linking to us as a ../index.html link?

phranque




msg:4585199
 7:10 am on Jun 18, 2013 (gmt 0)

you need to add a rule to your configuration that redirects all requests for the default directory index document to the directory itself with a 301 status code. (i.e. a url with a trailing slash)

that will provide the proper response to handle referred requests that include the index document file name in the url as well as search engines requesting to crawl these legacy urls.

linking internally to the canonical urls is a good signal of quality and intent but as long as requests for the index.html url resolve to a 200 OK response then your google index problem won't resolve itself.

mtreasure




msg:4585200
 7:17 am on Jun 18, 2013 (gmt 0)

Thank you for that. I suspected something like that would be necessary. Please would you mind explaining exactly what I need to do? I'm not sure where to start with this exactly. Your patience here is much appreciated!
Martin

phranque




msg:4585252
 8:21 am on Jun 18, 2013 (gmt 0)

btw welcome to WebmasterWorld, Martin!


as netmeg posted earlier:
If you're using Apache, best to ask in the Apache forum.


this thread would be a good place to start.
.htaccess redirects for index.html, index.php and index.htm to /:
http://www.webmasterworld.com/apache/4485314.htm [webmasterworld.com]

try some code - if it doesn't work, report what you tried and your results and you will get some help in the Apache forum.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved