Forum Moderators: open
I have noticed few times that if I search for keyword1+keyword2+keyword3+keyword4...keyworn Some sites come up high in the search results with URL of
example.com/index.htm?keyword1+keyword2+keyword3+keyword4...keyworn
now this weired. I can understand the site itself track my search terms and than show it in the URL, but how google can be fooled into indexing it. or do i miss something.
does anyone else notice it? it is usually brings porn sites.
anyone?
The search terms used link to a dynamically generated page with the terms in the title.
Googlebot follows the link off the 'terms used to find this site' page and the dynamically generated page gets indexed.
Does than make sense?
I have a friend who began to receive a growing amount of spam by email about a year ago. Being one of the contact points for a UK national organisation, many other websites had printed his email address, and it was being spidered and harvested there. The website of the organisation itself had protected the email address by using document.write statements to stop it being spidered. I was aware of some of the sites that had published the email address in plain text or as a clickable link, and we contacted them to get the address removed. We used Google to look for more, but aware of various log spam scams, and fake sites, I always used just name@subdomain in the search box, rather than the full name@subdomain.domain.com email address, to find sites that had included the email address in their content. We found loads of genuine sites and contacted all of them to get the address removed from their pages.
However, at the end of the SERPs was a set of odd results: a group of pages where the URL was just the IP number and a filename 123.456.789.123/filename.txt, and the Google description just a short list of email addresses, including the email address we were trying to get removed. We did not click this link. This looked like some sort of scam. We copied the URL from the Google results, and pasted it into a new window (didn't want to send any sort of referrer information to the site). This site just listed thousands of email addresses as plain text. Most wierd. Is that the results file of an email harvester or what? We got most of the other (genuine) sites to remove the email address from their content, a long time ago, but still have no idea what to do about the last email list site. They have no other content than these long lists of email addresses and the server root does not allow access. How do we get them to drop the email address off their site?
Here's the relevant bit to this thread (but it will make no sense unless you read all of the above first). A few weeks ago, I accidentally clicked the link to this email list site (thereby sending a referrer of www.google.com/search?num=100&q=name@subdomain to the site. This week some new results have appeared in the SERPs for the name@subdomain search. One of them is www.name@subdomain.spamsite.com, which has a Google description which includes my partial email address name@subdomain (not the full name@subdomain.domain.com email address), as well as a load of random words. Clicking this link opens at least 10 windows and tries to set 7 or 8 cookies. It is a porn site (of course). Unchecked, it crashes Mozilla after about 30 seconds. So now there is a site with a URL that includes my extremely rare name@subdomain search, a search that I suspect has not been carried out by any other person on the planet except me.
The other new results also look like spammy URLs, and these also offer the partial name@subdomain email address in the Google description along with many other random words. Clicking any of those results either gives a 500 Server Error or goes to the same porn site with multiple popup windows.
In the SERPs the original sites with just the very long lists of email addresses as content were the very last sites in the results. The new spam porn sites are listed near the end of the results, but are placed just before the original email list sites in the results. All of these scam sites are listed after the normal looking results.
This sort of spam really stinks. Morals of the story. If you search for an email address, don't use the full address. If you have a website, don't leave the email address in full view.
[edited by: g1smd at 10:29 pm (utc) on Jan. 15, 2004]