Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Finding duplicate content

         

realmaverick

1:49 am on Jul 3, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hey guys,

I hired a writer who borrowed huge chunks of content in many of her posts.

I used copyscape to diagnose the issue and had a shock when I found it. I went through and fixed everything copyscape picked up. But other than go through 30 3,000 word articles line by line in Google, is there an easier way I can double check?

Many thanks.

brotherhood of LAN

2:16 am on Jul 3, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Copyscape has an API.

FranticFish

8:07 am on Jul 4, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Google short snippets (say 15-20 words) in speechmarks and see what you find.

Planet13

2:26 pm on Jul 4, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Have the writer re-write the articles?

netmeg

2:50 pm on Jul 4, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Something like this situation, I'd use a paid tool for, like Raven or Screaming Frog.

realmaverick

2:23 pm on Jul 7, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thanks for the replies.

I've used Copyscape on each page and they all pass OK. The previous pages I found with snippets of borrowed info I either removed or rewrote. I also removed any quotes just to be sure.

I've googled hundreds of snippets from random pages and cannot find any copied content.

Netmeg, I have screaming frog and am aware how to find internal dup content, but not sure how to use it to find external?

The website is 2.5 months old. We publish 4-5 articles per week and they average 2-3k words per article, the articles are written by professionals with 25+ years experience in the field. Each article is shared well via social media and our Facebook page already has 10,000 followers.

But Google is acting as though we don't exist. Maybe this is more of a sandboxing period vs a penalty?

brotherhood of LAN

2:32 pm on Jul 7, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I'd double check there are no spidering issues i.e. barriers of entry to your content. robots.txt, potential hacks, quirk in markup etc.

realmaverick

2:42 pm on Jul 7, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I've ran Screaming frog a few times. It appears to pick up all of the pages, nothing out of the ordinary. Google has also indexed 100% of the content too.

netmeg

2:46 pm on Jul 7, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I somehow don't think duplicate content is your issue, but maybe.

How much of your content (percentage wise) is this one duplicating writer responsible for?

realmaverick

4:04 pm on Jul 7, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Probably 10-20% of the articles had chunks of duplicate content in them. The other articles all appear to be fine. Even those with duplicate content, also had more original content vs duplicate.

I haven't personally experienced the sandbox effect for a long time. But after this experience, I googled it and found a fair bit of recent chatter about it.

We haven't actively built any links, which may also be part of the issue. I didn't want for our brand new website to be under the radar for unnatural link building patterns. But we've picked up a few along the way.

netmeg

5:48 pm on Jul 7, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Well it's a pretty new site. Things don't just pop to the top anymore unless they're something really unusual, newsworthy or viral. I think you just have to be patient and build your way up.

realmaverick

1:30 am on Jul 18, 2014 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Yeah, it could well be an age thing. I didn't expect instant success. But I didn't expect 6 visitors from the volume of articles we have and the amount of attention we get elsewhere.

One thing that's been crossing my mind all too often is the hosting. It's hosted with a website that contains lots if thin content (largely image based). I wonder if that could effect it? Because it's only 2 sites on the dedicated server, it's clear own them both.

Is it worth switching to it's own server, even to eliminate the possibility? Or a little too much hit and hope?