homepage Welcome to WebmasterWorld Guest from 54.166.66.204
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / WebmasterWorld / Webmaster General
Forum Library, Charter, Moderators: phranque

Webmaster General Forum

    
Identifying Duplicate Content Sites/Non-TLD Sites
Duplicate or Other TLD Site?
jmccormac




msg:4388743
 1:12 am on Nov 19, 2011 (gmt 0)

I've been working on a project categorising approximately 2 million .eu websites over the last few weeks. One of the final issues is determining if a website is genuinely a .eu website or a site from another TLD being served as a .eu website. The theory is that some purely other TLD site will have no .eu relative <a href= tags (.eu sites will potentially have .eu or site relative anchors). (I've also used link rel="canonical" element to identify some non-eu sites as the canonical element is supposed to be domain specific.)

While some outbound links will be to stats sites or Social Media networks, does the logic that a site with an array of what appear to be navigation links to the same non-eu website is actually a non-eu site being served as a .eu site and is therefore a duplicate content website hold up? Or would it be neccessary to compare these pages with the other TLD website page to see if they are identical and thus duplicate content?

Regards...jmcc

 

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / WebmasterWorld / Webmaster General
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved