Forum Moderators: open
I checked some of my competition listed in the top 10 and did find one site that scraped some text from me.
What percentage is considered duplicate content? What percentage would trip this filter and why would my site be the one considered the duplicate content?
Even when I do link:www.domain.com
it shouws results for:
domain.com
domain.com/index.shtml
www.domain.com/index.shtml
why would they show the same page as being a backlink to itself?
also, why is G not seeing that index.shtml IS the same as domain.com? It knows it's duplicate content, even grabbed the exact same description for all four listings.
I'm going to guess that this is something google will fix.
I am also seeing inctances where the same page is indexed more than four times, including tracking urls from paid ads. This really does not seem like something they would want to happen and it seems like something that would be simple to fix.
To confuse matters, I did find a site that copied verbatim a paragraph, my title, and my description.
I also had my site on two servers at the beginning of May to help google find me during the transition from one server to the next.
So, I'm baffled as to which of these, if any, are the culprit of this duplication filter problem.
What a MESS!
Should I wait for the index to settle?
Should I fill out a spam report?
Should I be concerned about having www.mysite.com and mysite.com both listed? Note that www.mysite.com and mysite.com only are returned in results when I search for the content of my site.
Thanks!
I just highlighted the first paragraph of my site and pasted it into the search box. Low and behold a site came up with my same title. I clicked on the site and it took me to amazon.com. When I click on the cache, its my site!
What should I do?
What you should do:
1. Your competitor is using cloaking. Fill in a spam report.
www.google.com/contact/spamreport.html
2. Your competitor is stealing your copyright content. Fill in a copyright report for google. I can't find URL right now.
If you read message 23 of this thread, it will give you details on what to do when someone copies your content.
"In order to show you the most relevant results, we have omitted some entries very similar to the 54 already displayed.
If you like, you can <link>repeat the search with the omitted results included.</link>"
If you're seeing one of your own pages listed as a duplicate (e.g. domain.com/index.html shows up instead of www.domain.com/index.html), I wouldn't worry very much. That page should still appear in searches, and probably we can make it into the canonical version over time.
If someone else has copied your page, then you should try to solve the issue with them. In the worst case, you could escalate it to assert your ownership of your page.
This is not really a Google issue, other than the fact that "&filter=0" might allow you to find someone who copied your page. Even if we removed the other person's page (which we wouldn't do without a correct DMCA request, because we can't tell who really owns the page), you could still have the same problem with the other person's page showing up in other search engines.
So: Google tries to be the best reflection of the web that we can, but if someone has copied your page, that's a copyright issue, and not a spam issue. I encourage you to work with the other person to resolve the issue, because even if you did a DMCA request with Google, you'd still have to worry about those pages showing up on other search engines.
One of my index pages (was there last month during dominic :) ) has also been caught out by this filter but I am struggling to find a site who has ripped me off. It is a competitive keyword with 3,000,000 in the results but thankfully it is not an important keyword for me as it is a generic relatively non-commercial term. Saying that, it was ranked no 8 last month (and for the last year or so) but this update with the &filter=0 applied it has gone up to no 5. It seems strange that Google finds my index page more relevant this update but chooses not to display it!
Are there any other causes GG or should we sit tight and wait until the dust settles?
but I am struggling to find a site who has ripped me off.
Go to your site and pick out a few unique phrases that are 5-10 words in length (but don't choose a phrase that contains your site or business name, as infringers will usually change that to their own site or business name). Then head over to the fi google server, and type in your unique phrase with " " around it. The " " means it will look for that exact phrase, with the words in that specific order. Be sure to add the &filter=0 on the end, or click the "repeat the search with the omitted results included."
You can try a few of your different unique phrases, and if someone has copied your content, chances are it will show up this way. I find many copyright infringers this way.
?
What should I do?
Change your .htaccess file to redirect the yourdomain.com to www.yourdomain.com (or vice versa) so Google can sort it out for the next update.
I think maybe some of the confusion has come about since people can now do directories as directory.yourdomain.com instead of just the traditional www.yourdomain.com/directory/
Because one site is coming up without the filter, I think it is judging the second as a duplicate. The change to .htaccess should correct it, and hopefully your rankings will change accordingly with the next update.
Dominic gave us a bad affiliate URL. Esmeralda has ensured that this bad URL is history. But my index page does not show for my main two keyword phrase - it shows for three or more keyword phrases. For the main keyword phrase, Google shows the index page of a sub-directory and that result is buried on the 5th or 6th page.
Unlike textex, my index page is *not* cloaked. I do not use cloaking anywhere on my site. I do not have hidden links nor do I have links from guestbooks. Hard work did pay off and Google is showing the number of true backlinks to my site - 490 now from 73 that Dominic slashed them to.
Can anyone please shed some light on why this is happening?
What do you think of a case scenario where you have one site but google has it under multiple domain names.
For example:
i have a site that is listed under www.domain.com/
but i also have a secure domain name that is used for the parts of the site that require ssl (but i can use one shared cert).
so the same content is found at ht//domain.domainnameofsecureurl.com/
What are the implications of this scenario?
Thanks!
[edited by: rustybrick at 10:25 pm (utc) on June 17, 2003]
BTW, you will need to remove your last Google link, as linking to specifics goes against the TOS. Most questions and comments can be answered without knowing your specific site name or URL.
Good theory, but not quite the solution. Google has filtered one of my main site pages as a duplicate. Did some checking and found some moron who copied my main page and posted it on four different domains. My site has been top of the SERP's for a year, this guys domains are six months new. My site has 100 backlinks (some unrecipricated PR8's), this guys domains had 10 backlinks. Google chose to think MY site was the duplicate. LOL, the computer doesn't seem to be able to make a SMART decision re: who owns the duplicate and who owns the original. FWIW
Anyway, sent a C&D, guy removed my stolen content - but, since the content is no longer there, a DMCA won't help because the page no longer has my content on it. I have gotten it taken care of, but MY site is now being penalized. It should have a top 8 listing on the #1 keyword, instead has none on that keyword.
So, while you can take care of the issue CAUSING the problem, taking care of the problem itself (dup penalty) does not have a solution, aside to wait until the next update, by which time, my site may hit the same issue again. Sigh.