homepage Welcome to WebmasterWorld Guest from 54.234.228.64
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
How is article duplication viewed in Google News?
ieconomists




msg:4599649
 8:06 am on Aug 6, 2013 (gmt 0)

Hi all,

Was hoping someone here could help me a little. I am looking at buying a website that is indexed with google news.


It is indexed in google news, which is crucial. However, I have noticed many of the articles on the site appear to be copies of articles that are from big famous publications.

Is this a major problem? Is there a good chance of the site losing its google news status?

As a side note, my intention upon taking over the site is to post original content. But I am just worried about this old content affecting things and losing google news indexing, which in effect will mean I purchase a lemon.

Thanks in advance.

 

Robert Charlton




msg:4599657
 8:55 am on Aug 6, 2013 (gmt 0)

Hi ieconomists, and welcome to WebmasterWorld. As I think you've guessed, Google news doesn't like copied content...

Google News general guidelines
https://support.google.com/news/publisher/answer/40787?hl=en [support.google.com]

Journalistic standards. Original reporting and honest attribution are longstanding journalistic values. If your site publishes aggregated content, you will need to separate it from your original work, or restrict our access to those aggregated articles via your robots.txt file.

Uniqueness is mentioned several times in the guidelines. In your situation, a big concern would be getting reported by the originators of the material.

I certainly would want to duplicate anything that Rupert Murdoch owned, eg. He's been particularly outspoken about news copying. Understandably, other content originators might also be upset if they see you using their material without permission.

ieconomists




msg:4599665
 9:03 am on Aug 6, 2013 (gmt 0)

hmm, this puts me in a very tricky situation, I really want this website. But this could be a deal breaker

engine




msg:4599669
 9:10 am on Aug 6, 2013 (gmt 0)

Welcome, ieconomists

Are we talking about news from press releases (distribution services, such as PRWeb, BusinessWire, etc.) on the existing site, or from other news sites?

ieconomists




msg:4599680
 9:46 am on Aug 6, 2013 (gmt 0)

well, as an example some of the articles are duplicate articles from Businessweek.,

ieconomists




msg:4599694
 10:21 am on Aug 6, 2013 (gmt 0)

I actually just checked further, and the articles are pretty much copied, but certain words throughout the article are changed and substituted for words that mean the same thing. If this changes anything, i dont know.

Also, ive checked back as far as two months and they have been doing this same thing. They have likely been duplicating these articles for over 3 months.

Can anyone give me advice here, do you think its worth the risk of buying this site considering this?

Also, just want to add that the responses and warm welcome here has been great. Will definitely be sticking around :) thanks all.

engine




msg:4599797
 5:46 pm on Aug 6, 2013 (gmt 0)

The fact that there's some customization makes it look as if a journalist has written the content from a news release. The extent of that re-writing would give me cause for concern.

The site has, obviously, done enough to pass muster with google, but that doesn't mean it will remain.

If the price is right, go for it and get working on minimizing the duplication.

JS_Harris




msg:4599905
 6:02 am on Aug 7, 2013 (gmt 0)

I don't think Google hates the actual copied content as much as they dislike not finding anything they haven't seen elsewhere. I wouldn't focus on exact order of words anymore as Google seems to classify pages by meaning and intent more now than ever before.

Most of the times the basic facts will be repeated and quotes are the norm so some measure of copying is expected. Every site publishing the story needs to have something to add to it or it is filtered out as duplicate.

ieconomists




msg:4599907
 6:13 am on Aug 7, 2013 (gmt 0)

Ok.

Im getting a bit worried now that its indexed at all now. Its weird, when I search some article headlines I find them in google news and others I do not.
Is it possible that google news does not index every article a source of theirs publishes?

Robert Charlton




msg:4599917
 7:11 am on Aug 7, 2013 (gmt 0)

Is it possible that google news does not index every article a source of theirs publishes?

Amazon doesn't get all of its pages indexed in Google, and it's quite likely that a news source doesn't get all of its articles indexed... but I can't say that with any certainty.

You might try running some tests. Identify a site comparable in authority and size to the site you're evaluating, and then run comparable searches. I'd search for headlines or text strings in articles of comparable age, with and without quotes.

Also, try a site:domain type search (ie, use the search operator), in this format...

site:example.com "quoted query string here"

Note that there should be no space between the site: operator and the domain.

It's possible for topical web content, once it's off the front page and has disappeared from current news, not to show up in a search of the entire web... unless it received some external links or the site itself is well optimized. You may not have a way of judging that.

Both the quoted search (for a unique text string) and the site: operator search help narrow that down considerably... and would be a better indication of what's in the index.

I'd search for old articles in the regular google.com, not in news.google.com.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved