Forum Moderators: open

Message Too Old, No Replies

Whats the big problem with cloaking [part 3]

         

WebGuerrilla

4:40 pm on Oct 19, 2000 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



continued from part 2 [webmasterworld.com]

Vista,

The reason this site is ranked #1, even though the title and description doesn't even contain the words real estate isn't because they are using any kind of spam techniques. This is simply a bi-product of AV's theme indexing.

If you have supporting content on your site for a particular phrase, it is quite common for AV to return the home page as a match even if the phrase only shows up in a hyperlink. If the hyperlink takes you to additional content that makes extensive use of the phrase, AV will give you credit for it but it won't return the page that actually includes the terms.

We have a couple clients that are experiencing this 50%-70% of the time. As we develop and submit new optimized content, the number of home page matches increases. Even when the new phrase is only included in a hyperlink.

IMHO, this is one of the flaws of site matching vs. page matching. In the example you gave, I would have clicked on one of the other listings because the title and descriptions contained my search phrase. And all the AV data we have collected suggests that most searchers react the same way.

I would much rather have a lower rank with an accurate title and description than a #1 position with a general description.

vista

8:33 pm on Oct 19, 2000 (gmt 0)



Dave_Stewart,
1) The search was for: costa rica real estate directory and not for costa rica internet directory
2) I know for fact they spam the SE's with thousands of pages

Yes, they are placed number 1.
Yes, I have my site in top 10 as well.
No, I don't mind the competion and I love to learn and get better. That's why I am here. :)
I'll bet you $1000 that I will beat them. :)

vista

8:40 pm on Oct 19, 2000 (gmt 0)



Well said WebGuerrilla and it's a very important find at leat for me. Thanks!

That means: instead to go just for costa rica real estate perfectly optimized pages, better would be to also include 100 links to RELAVENT information on: costa rica real estate directory.
That would really have to score great (in case of AV anyway). Right?
And then cloak the succker! :)

lizzie

6:25 pm on Nov 1, 2000 (gmt 0)

10+ Year Member



That discussion above about how one person felt it was inappropriate spam on Alta Vista when he searched for "Costa Rica real estate" and others felt it was right on target proves my point:
don't censure anything, put it all out there so each of us can choose what we want to look at.
If you think a listing is inappropriate just skip over it and look through the first 20 or 30 listings and look at the ones that sound relevant to your particular search.

vista

6:41 pm on Nov 16, 2000 (gmt 0)



Ok lizzie, that's one solution. The other is to make my own search engine :)

Only thing I am trying to say: the quality and accuracy is important NOT the quantity.

jessi

5:48 am on Nov 17, 2000 (gmt 0)

10+ Year Member



vista I really agree with that we need more relevant page. Me and my boss had recently made a page all ready to go but we had not yet submitted to the SEs or directories. The next moring while looking at the new site posted in yahoo there it was a mirror of our new page with another companies name imagne that!
And when we try to look at the source code it was cloaked can you believe that, having to look at your hard work snatched while you were sleeping (great movie by the way ) and they still had the guts to cloak the page. of course we took action immediatley. everything is ok now. This is a diffrent point of view what do you think lizzie?
( by the way I'am a newbie also )

jbeeze

9:55 am on Mar 3, 2001 (gmt 0)



Hi. I am really a newbie. Just picked up Spider Cloak hoping to keep competitive sites from stealing my directory structure (which includes a bunch of literature I have legitamate rights to make available ONLY in the context of my site.)

Is there another way to keep Web Bandit and code ripper and such from accessing my source files?

Another thought, if an alliance of search engines was created to spider new submissions twice, once with the spider address as usual, and another with spidering capability from variable IP, wouldn't a large disparity in the ratings between these two allow filtering of Spider Cloak type technology used for spamming?

DaveAtIFG

4:52 pm on Mar 3, 2001 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Welcome to WmW jbeeze!

>wouldn't a large disparity in the ratings between these two allow filtering of Spider Cloak type technology used for spamming?

This is one the shortcomings of cloaking, generally an SE that wants to detect cloaking will simply field a spider from an "unknown IP" and compare results. We refer to them as shadow spiders. INK for one, was doing this and may be still, although I believe most SEs have concluded that cloaked pages represent only a tiny percentage of the spam problem they are constantly combatting and have focused their attention elsewhere.

Cloaking done right is a good deal of work, you'll spend considerable time watching your log files for these rogue spiders as well as maintaining a dozen cloaked pages for every one human viewable page that you cloak...

>Is there another way to keep Web Bandit and code ripper and such from accessing my source files?

I haven't given this a lot of thought but off the top of my head, if they provide a unique USER_AGENT string, you could ban by user agent in .htaccess.

Air

5:13 pm on Mar 3, 2001 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hello jbeeze, welcome to the forums at WebMasterWorld.

>Is there another way to keep Web Bandit and code ripper
>and such from accessing my source files?

Yes, you can deny access by IP Address to any visitor including core rippers.

Some examples HERE [webmasterworld.com] if you are on a Apache web server.

>Another thought, if an alliance of search engines was created ...
Not only possible but already in place, although not an alliance really, just spiders from the same search engine using different IP addresses, multiple User Agents, retrieving the same page multiple times, and ignoring robots.txt at times. If your cloaking method does not handle these situations favourably then your pages will be flagged for further review, or an automatic penalty applied.

...ooops we posted at the same time Dave - a double answer - cool :)

jbeeze

10:38 pm on Mar 5, 2001 (gmt 0)



Thank you all for help and patience!

Yes I am on Apache.

I AM new to search engines and spiders. I'm reading TCP/IP for dummies. I am sure I could phrase those questions I have a lot better when I'm clear on my terminology.

I want to learn the art of optimizing pages for a few of the more known spider bots. Where on the forum to go??

DaveAtIFG

5:13 pm on Mar 6, 2001 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Air started a thread at [webmasterworld.com...] that has just what you're looking for.

jbeeze

6:11 am on Mar 9, 2001 (gmt 0)



Thanks Dave and Air!

question: from Air's list, "Cloaking on non promotional domains tends to do better than straight promotional domains. Likely due to the boost from a real domain appearing in directory listings."

I'm not sure what this means.

Also, I would like to know if people are using cloaking to present non flash.swf data to Search Engines, while presenting flash.swf files to visitors? Is there any way an engine could highly rate a site which was primarily .swf content? If not, it's really important to overcome this inability of the engines. I think www.flashkit.com is a good example. It appears that they use a lot of .swf content and that this would make most of their site complete gibberish to an html text favoring engine.

WebGuerrilla

10:13 am on Mar 9, 2001 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member




A non promotional domain is the address of the actual site where you ultimately want visitors to land. A promotional domain is a second site that is setup only to deliver traffic to the main site.

Cloaking from your main site works better because overtime, you will build link popularity to url's that are delivering cloaked content to spiders. Along with getting a boost from directory listings, you also get the added benefit of greatly increasing the number/frequency of spider visits.

If you use a secondary or promotional domain to serve cloaked pages and then you rediret visitors to the "real" site, you will have a more difficult time getting ranked for competitive phrases because no one will ever link to the promotional domain.

Regarding flash, yes many sites use cloaking to deliver content to spiders. It really is the best way to deal with flash content. You can also incorporate some javascript that will allow you to check and see if the human visitors have flash installed. That way, you can give them the proper page without forcing them to click a link to get the page they can view.