Welcome to WebmasterWorld Guest from 3.233.226.151

Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Google, Bing, Semantic Search and Ranking for explicit terms not on my site

     
6:56 pm on Aug 22, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


Hi,

I have ONE article/tale in my site that is receiving organic traffic for adult keywords that it should not rank for(about 50k visits by now).

Please note that it is not an English website and I'll give some fake information about the keywords and content of the page being targeted.

The title of the page is:
Children tales: A tale made by my 10 year old daughter - Mary and the Storm

The content of the page is about a girl that finds that her father(fisherman) is smuggling guns to earn extra money to feed her.

This tale was actual written by this child and there is absolutely nothing erotic about it, nothing. Although the site has categories targeting erotic content, this pages are only available for registered users and the content is hidden from search engines/guests.

People are finding this page mostly when searching on mobile phones and using keywords such as:
Erotic tales father and daughter
Erotic tales father eating daughter
Incest tales father and daughter
Father eating daughter
My daughter seduced me
I ate my [...]
Eating my [...]
Father eating [...]
...(there is around 200 variations)

Note that in my language, eating and fu** have similar meaning.

The words tales, father, daughter and eating can be found on the page.
Erotic, incest, virgin, fuc** and other adult keywords cannot.

I though this could be an issue with Google's Hummingbird algorithm but strangely, Bing his having the same behavior. This is such an isolated/extreme case that It makes me wonder if Bing is scrapping Google results or vice-versa. How can the both engines agree on this ERROR?!

I did search on both Bing and Google to find links to this page (with safe search disabled) and found only 2 links from safe pages.

Do you have suggestions/tools to find out why is this page ranking?
I did find a blogspot blog with adult content linking to my domain. Should I disavow this blog?
Should I remove erotic categories/articles from my site?
What do you think about this Google/Bing handshake?

I don't want to be associated with pedophile content :(
Thank you!

[edited by: brotherhood_of_LAN at 7:05 pm (utc) on Aug 22, 2014]
[edit reason] removed a couple of more explicit phrases to avoid upsetting readers [/edit]

9:54 pm on Aug 22, 2014 (gmt 0)

Preferred Member

joined:Oct 15, 2011
posts:429
votes: 0


Have you looked to see how the page translates using Bing and Google? I've seen a lot of translation hack jobs, and that may help to explain why you are getting traffic for keywords you think you shouldn't.
10:14 pm on Aug 22, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


@turbocharged

Can you be more explicit? I'm not understanding. You want me to translate the article from language A to language B? The article is in language A and it is ranking for Language A keywords. What is a translation hack job?
10:51 pm on Aug 22, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:May 16, 2014
posts:141
votes: 0


Note that in my language, eating and fu** have similar meaning.

The words tales, father, daughter and eating can be found on the page.


It seems to me both search engines are responding to on-page factors more than links. If your categories containing erotic content aren't indexed, that is also probably not a source of the issue.

A no-index on that page should prevent potentially unwanted searchers from showing up on your site while still allowing user access to the page. If you don't want to no-index, then I'd work on attempting to remove or use synonyms for "eating".

I realize you might not want to alter your daughter's work, but I think the ambiguity of "eating" in your language is probably the main issue.
11:24 pm on Aug 22, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


I realize you might not want to alter your daughter's work, but I think the ambiguity of "eating" in your language is probably the main issue.


Just to clarify 2 things that I poorly explained.
1- Not relevant but it's not my daughter, it's the poster's daughter.
2- "eating" is not a synonym for a bad word in the dictionary. They have similar meaning on a sexual context(slang, jargon). I believe the same happens in English with "playing" or "riding".

"Then Mary and his father sat eating the food that the father brought from home"

This is a quote from the original text where the "eating" word appears. It appears only once! There is nothing ambiguous in this sentence from an human point of view but somehow it gets the bots confused.

Thanks for your suggestions Clay.
9:09 am on Aug 23, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:May 16, 2014
posts:141
votes: 0


You were the original poster, so when you say it's the poster's daughter I just naturally think .... nevermind

In English:
Male bird = c#ck
Cute cat = p#ssy

Either one is fine within the context of a relevant discussion. Combine the two and now you are in a different realm.


"Then Mary and his father sat eating the food that the father brought from home"

Mary and HER father ate the meal the father brought home.

You don't pay attention and waste people's time trabis.
11:24 am on Aug 23, 2014 (gmt 0)

Preferred Member

joined:Oct 15, 2011
posts:429
votes: 0


The article is in language A and it is ranking for Language A keywords.

I misunderstood the original post and thought you may have been receiving traffic from another language. Obviously that's not the case.

Somewhere, probably on an onsite factor, is telling the search engines that it is erotic content. If your navigational display on that article page includes links to erotic content with erotic words, that combined with the author's words may be just enough for search engines to make the association. If this is the case, then Clay_More's suggestion to use noindex is the fix if you do not want to remove the navigational references to the erotic content.
11:41 am on Aug 23, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


@Clay_More
You were the original poster, so when you say it's the poster's daughter I just naturally think .... nevermind

I never said I was the poster. My site has several posters, just like this site.

Please note that it is not an English website and I'll give some fake information about the keywords and content of the page being targeted.

...

"Then Mary and his father sat eating the food that the father brought from home"


Mary and HER father ate the meal the father brought home.


I was trying to translate the quote From the original language into English word by word. Yes, It is HER father. I'm not an English speaker.


You don't pay attention and waste people's time trabis.

"What we've got here is failure to communicate"
12:09 pm on Aug 23, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


If your navigational display on that article page includes links to erotic content with erotic words, that combined with the author's words may be just enough for search engines to make the association.


Under each article I show a list of other articles from the same user. It could be that this user also posted erotic content and that there was a flow of link juice.
But...This poster/user happens to be a nice lady that only posts about love, peace, birds, rivers, flowers....

I could not find internal links from/to this page that could relate it to erotic content.

[edited by: brotherhood_of_LAN at 4:37 pm (utc) on Aug 23, 2014]

12:11 pm on Aug 23, 2014 (gmt 0)

Senior Member from GB 

WebmasterWorld Senior Member brotherhood_of_lan is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Jan 30, 2002
posts:5044
votes: 59


I'd maybe start by having "nofollow" on any page that you know has explicit content, to avoid permeating any signals from those pages.

mod's note... the thread is 'noindexed' so there's no problem discussing the topic, but let's avoid discussing explicit terms where they can be avoided. Thanks.
4:53 am on Aug 25, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:May 16, 2014
posts:141
votes: 0


@trabis

On this site, you were the OP talking about an article on your site.

If you are looking to solve the issues of the poster on your site, then I think it's already been covered.

@boL

Tried to make the examples non-offensive while still attempting to convey the concept words in conjuction could be problematic.

I'm guessing the OP is dealing with a Portuguese/Brazilian issue, but it's all likely based with on-page factors
.
2:07 pm on Aug 25, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


I'm guessing the OP is dealing with a Portuguese/Brazilian issue

Caught!

I have other pages with father, daughter and eating and none is having this problem.

Following your example, having "male" and "bird" in different paragraphs should never be a problem, should it?!

This is a UGC site so I have many articles and this is the only one(that I noticed) which is ranking for adult terms.

I did some extra research and found:
- This article was posted on 08/2010, on 08/2012 it started ranking, on 08/2013 and 05/2014 it doubled the traffic, on 01/08/2014 it lost 2/3 of the traffic. I does not match the hummingbird release date.
- The title and 1st paragraph can be found in 2 sites that generate spun content.
- Google shortens the title hiding the category(tales->children)


I'll try changing the title from:
A tale made by my 10 years old daughter - Mary and the Storm
To:
Mary and the Storm
and see if it will make a difference. That should help get rid of the "daughter" and "10 years old" keyword.




4:30 pm on Aug 25, 2014 (gmt 0)

Senior Member

WebmasterWorld Senior Member planet13 is a WebmasterWorld Top Contributor of All Time 5+ Year Member Top Contributors Of The Month

joined:June 16, 2010
posts: 3828
votes: 31


Somewhat related...

I had a piece of wall decoration and I gave a measurement of the decorating in inches:

"Widget Sign, 10 Inches Wide"

So google started ranking that page for things like:

"Inches symbol"

"inches sign"

"sign for inches"

Basically, people who were looking for the symbol for inches (in the US we commonly use " for inches and ' for feet) were getting served my page in the google results.

This went on for at least a year or two until one day google finally decided that it had gotten it wrong.
4:53 pm on Aug 25, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


@Planet13

Did Bing had the same behavior? It is so strange seeing both SE agreeing on this.
4:56 pm on Aug 25, 2014 (gmt 0)

Senior Member from GB 

WebmasterWorld Senior Member brotherhood_of_lan is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Jan 30, 2002
posts:5044
votes: 59


Try googling/bing(ing?) the page with the terms you believe it ranks for, with safesearch on and off. It may well be that your site is flagged as non-explicity but the word proximity alone is putting you near the top of the results (a best of the rest).
5:25 pm on Aug 25, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


@BOL

Interesting!
Searching for "tale father and daughter" with safe search disabled brings lots of unsafe results. These unsafe results contain keywords such as "erotic", "incest", "pregnant", "seducing". This keywords are being associated with my page because they are present in these other pages ranking for "tale father and daughter".

My conclusion:
It seems like Google semantic engine runs on an unsafe mode. Safe looking queries get associated with unsafe meanings. On safe mode, Google looks for all safe and unsafe meanings, grabs all results and then filters out unsafe ones.

Also, if your site is only about BLUE WIDGETS and the majority of websites about BLUE WIDGETS are also about RED WIDGETS, you may find yourself ranking for RED WIDGETS!

Thank you very much, this was productive :)
2:17 pm on Aug 27, 2014 (gmt 0)

Senior Member

WebmasterWorld Senior Member planet13 is a WebmasterWorld Top Contributor of All Time 5+ Year Member Top Contributors Of The Month

joined:June 16, 2010
posts: 3828
votes: 31


have you checked to see whether people are LINKING TO that page using those sexually related keywords?

or, maybe it is because someone has a directory that posts links to erotic stories and that page links to your page, too. (even if it doesn't use sexually related keywords on the specific link to your site).
8:23 pm on Aug 27, 2014 (gmt 0)

Junior Member

5+ Year Member

joined:July 29, 2014
posts:47
votes: 0


@Planet,

I did not find any suspicious link to this specif page.

I have scanned the links given by WMT and found a suspicious blog with many pages linking to my main page (root).

Regarding this specific blog:
- It is an adult(gay) blog, you need to be +18 to enter it.
- The content is very explicit and Google is crawling it regularly (according with the dates of the WMT export file)
- Very little traffic (I estimate around 20 visits day)
- It was created early 2011 according to the "online" widget.
- I have received a total of 26 visits from this site, the first one was on 3/2011 and last one on 12/2013
- My site appears in every page of this blog in the top of a feed widget that shows last entries from sites that the owner is tracking. Actually, it is 2 do follow links, one for my root domain and other for the last post on my site(the second link goes trough a feed burner proxy)
- site:gaydomain.com "mysite-name" shows 299 indexed pages

Regarding my link profile:
- I have about 730 root domains being blogspot 1 of them.
- Blogspot shows 2621 pages with links to my domain.

I have contacted the owner of this domain to remove my site from his feed list but I am still waiting for an answer. I don't believe this blog is a very strong signal for Google. It is an isolated case.

Please read my previous post. I think I already know why Google is ranking this page for keywords that cannot be found on the page.