Welcome to WebmasterWorld Guest from 54.159.89.7

Forum Moderators: Robert Charlton & aakk9999 & andy langton & goodroi

Message Too Old, No Replies

Top 20 Stealth Links - Getting your url in front of Search Engines by nontraditional means

Getting into the "URLs Live Every Where" mindset.

     
2:02 pm on Apr 16, 2009 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38057
votes: 12


In another thread on Virtual Page Rank [webmasterworld.com], we talked about ways that you can link to a site other than a direct http href link or links that may not show up on the link command radar.

Lets start a list of the ways sites can link, refer, or point urls to your pages other than direct hrefs. Mainly, we are after ways that SE's such as Google may run into and use urls they find:

  1. another site links to your graphics ( img src=http://www.searchengineworld.com/gfx/logo.png )
  2. a site links to your javascript files
  3. a site links to your css files?
  4. rss feeds and other xml feeds that people can link to without notice or referrals necc being generated.
  5. links in email that some se's can read (yahoo mail, hotmail, Gmail [webmasterworld.com])
  6. links marked with noindex
  7. links marked with nofollow
  8. raw urls within javascript or js comments
  9. raw urls within css or in css comments
  10. urls within meta data of graphics and video files
  11. urls within html comments
  12. urls within the head section, meta data of a html page, or alternate html entities (alt, name, id, etc)
  13. links or pages that maybe surfed while visitor has page rank engaged on the toolbar
  14. the target of a constructed, obfuscated, or encrypted js url (hidden until executed)
  15. links behind pay walls that Google can spider via webmaster tools
  16. Domains that have been 301'd with links.
  17. Links in Flash movies (games, quizzes, etc).
  18. non href'ed url's. (raw url on page http://www.webmasterworld.com)
  19. Links in any documents other than web pages e.g. .doc, .pdf, .txt, etc.
  20. blocking a page in robots.txt should make it blocked from bots, but they still spider it.
  21. Domain registrations/Whois and DNS data
  22. Links in form data.
  23. Links in other Google produced software (gadgets, widgets)
  24. NonTraditional pages (irc, twitter, UseNet, Yahoo, or Google Groups.
  25. Advertising links (AdWords/Yahoo), and other services like Maps.

What else? Wow - blew through 20....
I will update as we go. Thanks to everyone who pitches in...

[edited by: Brett_Tabke at 3:57 pm (utc) on April 19, 2009]

2:09 pm on Apr 16, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member bwnbwn is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Oct 25, 2005
posts:3492
votes: 3


16. Domains that have been 301'd with links.
2:26 pm on Apr 16, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month

joined:Sept 25, 2005
posts:1039
votes: 84


Links in Flash movies (games, quizzes, etc).
2:32 pm on Apr 16, 2009 (gmt 0)

Preferred Member

5+ Year Member

joined:Dec 19, 2007
posts:404
votes: 0


its entirely plausible that websites could be credited for non-internet links. There is no logic that says the value of a website is determined only by its internet presence.
Would this count?
2:40 pm on Apr 16, 2009 (gmt 0)

Full Member

10+ Year Member

joined:Aug 31, 2002
posts:284
votes: 0


>>raw urls within javascript or js comments

OR/AND

anything in javascript that might a look just a little bit like a URL enough for G to spider it

(it doesn't take much IMO)

2:47 pm on Apr 16, 2009 (gmt 0)

Senior Member from US 

WebmasterWorld Senior Member pageoneresults is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Apr 27, 2001
posts: 12169
votes: 55


Would http references in the cite attribute count?

<q cite="http://example.com/">Quote</q>
2:48 pm on Apr 16, 2009 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38057
votes: 12


yes pg1 it would. I guess I consider that "meta" data like alt or name, or id.

> anything in javascript that might a look

Ya, that is where I was going with #8 raw urls within javascript or js comments and #14 the target of a constructed, obfuscated, or encrypted js url (hidden until executed)

2:56 pm on Apr 16, 2009 (gmt 0)

Senior Member from US 

WebmasterWorld Senior Member pageoneresults is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Apr 27, 2001
posts: 12169
votes: 55


Links in any documents other than web pages e.g. .doc, .pdf, .txt, etc.
3:18 pm on Apr 16, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Feb 21, 2005
posts:2259
votes: 0


blocking a page in robots.txt
non-google toolbars accessing the page you view
unprotected traffic stats
5:00 pm on Apr 16, 2009 (gmt 0)

Full Member

10+ Year Member

joined:Oct 4, 2004
posts:289
votes: 0


What about links in desktop widgets like the Vista sidebar - that how I got to this page, when Brett twittered it.
5:49 pm on Apr 16, 2009 (gmt 0)

Senior Member

joined:Jan 27, 2003
posts:2534
votes: 0


I guess what we may be talking about here is content discovery (although that doesn't necessarily mean passing much value) and attribution (that should always pass value, right?). For a search engine, it makes sense (to an extent!) to pay more attention to URLs that other people are paying more attention to.

I think we've covered most of the common ones and also the less common ones :) but a few that come to mind:

- Analytics data (toolbars, javascript data collection, etc.). OK, a visit isn't exactly a vote (but is a link?) but enough of the right visits is certainly saying something

- Domain registrations and DNS data - Google have access to newly created domains, and also to DNS files, which will contain references to website hosts - some more frequently than others

- OCR - Google have mentioned this a few times (in connection with PDFs, for instance, and many images contain "watermark" URLs and suchlike).

6:17 pm on Apr 16, 2009 (gmt 0)

Preferred Member

5+ Year Member

joined:Nov 2, 2006
posts:410
votes: 0


Links in contextual ads
6:34 pm on Apr 16, 2009 (gmt 0)

Full Member

10+ Year Member

joined:July 12, 2002
posts:207
votes: 1


Links written on billboards, cars and other outdoor objects with help of Google Map.
7:51 pm on Apr 16, 2009 (gmt 0)

Full Member

10+ Year Member

joined:Aug 17, 2004
posts:217
votes: 0


great thread, unfortunately I don't have anything to contribute (it seems to have all been said), but great, great posts.

Thanks Brett, this is why I come to WebmasterWorld frequently, I'll be bookmarking this one :D

8:06 pm on Apr 16, 2009 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38057
votes: 12


Well, I think content discovery is ultimately what we want to discuss. If a SE can't get at the link, it probably shouldn't be in the list.

> widgets

Aps are interesting. I thought about it a long time (such as twitter aps), but I don't see a method of that url getting into a se bot. How could it?

> billboards

You know Dinkar, you may have something there. We know for a fact, that Gbot can read text inside of graphics (Google Catalog proved that in spades). Do you think they are running OCR routines over the Google Street View maps? Then they could feed any hits, back into Google maps in order to increase the quality of maps somehow.

It is all in the community Greenleaves. Just trying to give back some of what I get.

> Receptional Andy
> Domain Registration and DNS data

Big time. Nice catch.

[edited by: Brett_Tabke at 8:21 pm (utc) on April 16, 2009]

8:22 pm on Apr 16, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:May 26, 2000
posts:37301
votes: 0


Here's something I asked Matt Cutts about at Pubcon in Austin - URLs that Google now discovers by spidering forms. Matt confirmed that such "links" send PageRank. Google establlishes a virtual link on their back end when they find something through form navigation and they add that virtual link to the webgraph.
8:27 pm on Apr 16, 2009 (gmt 0)

Senior Member

joined:Jan 27, 2003
posts:2534
votes: 0


Seconded, Ted - perhaps the wider category is 'URL manipulation' - i.e. if site.com/?one-word exists, then perhaps so does site.com/?two-words.

Pretty much all of the form spidering I've seen is essentially manipulating GET parameters.

8:51 pm on Apr 16, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member bwnbwn is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Oct 25, 2005
posts:3492
votes: 3


Google gadgets is a mighty good place for a link discovery
9:30 pm on Apr 16, 2009 (gmt 0)

Moderator from CA 

WebmasterWorld Administrator httpwebwitch is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Aug 29, 2003
posts:4059
votes: 0


This discussion is straying away from inter-site linking, and all over URL discovery... but here are a few more

URLs discovered in any other HTML attributes, like
<img onerror="http://www.example.com" lowsrc="http://www.example.com"/>

for a full list of all possible ones, just take a look at the HTML DTD
<table background="http://www.example.com">
<iframe src="http://www.example.com">
<base href="http://www.example.com"/>
... and many more
(incidentally, all of these can also be targets for XSS, but that's another topic)

URLs inferred by parentage:
http://www.example.com/dir/image.gif => http://www.example.com/dir/

URLs used in CSS:
body{background:url('http://www.example.com/image.gif');}

URLs used in conversation via Instant Messaging and Chatrooms (assuming suspiciously that they are not very private)

Tweets and statuses and other user-entered content all over the www

brute-force scraping of tinyUrl et al

the URL set as your "home page" in your browser

*your* Browser History (yes, it is possible to scrape it using some nifty and unobtrusive JS+CSS techniques)

10:52 pm on Apr 16, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member swa66 is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:Aug 7, 2003
posts:4783
votes: 0


Referrer info given on an outgoing link.

Logs (either directly available or via some sort of statistical package.

1:02 am on Apr 17, 2009 (gmt 0)

Full Member

10+ Year Member

joined:July 12, 2002
posts:207
votes: 1


Do you think they are running OCR routines over the Google Street View maps?

They have technology and resources ($$$$$); so if they aren't doing it now, then they will start doing it anytime.
2:57 am on Apr 17, 2009 (gmt 0)

Preferred Member

joined:Feb 27, 2009
posts:384
votes: 0


I'm a bit confused here....

Is this just a "let's list the various link types" for fun thread, or are you implying that there is some value to some of the link acquisition methods described above?

Could you put a "*" next to the ones that you feel are most worthy of attempting to get (of course only the "moral" ones).

These types of links obviously cannot make up for good solid link building skills....but on the rainy days when no webmasters are wanting to reply to my link requests.....would one or two of these suffice to build up my link profile?

Kind of like how a weight lifter dude goes on vacation and there are no weights for him to work out with in the hotel he is staying at for 2 days....so he borrows his girlfriends "thigh master" for a day or two?

[edited by: BaseballGuy at 2:59 am (utc) on April 17, 2009]

7:06 am on Apr 17, 2009 (gmt 0)

Full Member

5+ Year Member

joined:Dec 2, 2008
posts:222
votes: 0


Your GMail Account
Your Google Toolbar
Your Google Chat.
8:25 am on Apr 17, 2009 (gmt 0)

Preferred Member

10+ Year Member

joined:Feb 10, 2006
posts:627
votes: 0


Bookmarks (even "private" ones) from services such is Delicious etc. (or that doesn't count in this context?)
10:13 am on Apr 17, 2009 (gmt 0)

Full Member

5+ Year Member

joined:Dec 2, 2008
posts:222
votes: 0


Forgot...Google Chrome
1:00 pm on Apr 17, 2009 (gmt 0)

Administrator from US 

WebmasterWorld Administrator brett_tabke is a WebmasterWorld Top Contributor of All Time 10+ Year Member Top Contributors Of The Month

joined:Sept 21, 1999
posts:38057
votes: 12


Thanks 2clean, httpwebwitch both of yours are covered in one form or another.

> OT baseball guy

This is a list of nontraditioal ways to get a url in front of GoogleBot or into the system. The value of doing so is a whole other discussion.

off topic - I cleaned 5-6 side topics in here. Feel free to start another thread with related issues.

5:59 pm on Apr 17, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month

joined:Sept 25, 2005
posts:1039
votes: 84


Google Adwords?
9:53 am on Apr 19, 2009 (gmt 0)

Senior Member

WebmasterWorld Senior Member 10+ Year Member

joined:Nov 5, 2002
posts:872
votes: 0


What about detailed location/company-description data in google-maps?

And then google earth's .kml- and .kmz-files, particularly with respect to community-applications like the google-earth-war-project.

(Both might be subsumed under 19) or 23), but shouldn't we better treat each "format other than webpages" on it's own?)

Image-Captchas!

Related: URLs may also be encoded as the "result" of those magical stereoscope images (if you definetly want to keppe SEs out;)

3:48 pm on Apr 19, 2009 (gmt 0)

Junior Member

10+ Year Member

joined:Apr 14, 2005
posts: 62
votes: 0


Gbot can read text inside of graphics (Google Catalog proved that in spades).

How about URL written on a graphic?

Do you think they are running OCR routines over the Google Street View maps? Then they could feed any hits, back into Google maps in order to increase the quality of maps somehow.

Honestly, it would not be surprised if G is doing this. However, if G is doing this and putting this as a factor in ranking a site, I seriously worry that Internet will become another world that is like our real world (By buying more ads/billboard ads offline, which is easily bought with $$$, a site ranks). There gone the long tail markets and left only manipulation by big shark in the real world.

3:15 am on Apr 21, 2009 (gmt 0)

Junior Member

5+ Year Member

joined:Sept 26, 2006
posts: 145
votes: 0


[webmasterworld.com...]
In the above thread Brett said:
I am pretty much convinced that links in Gmail pass some juice.

Is there anybody who has noticed this?
How did you find it?

This 37 message thread spans 2 pages: 37