tedster

msg:4382255 | 7:41 pm on Nov 1, 2011 (gmt 0) |
In particular, Google can now index Facebook comments as well as those on some other social sites such as Disqus or Intense Debate. See http://www.techmeme.com/111031/p54#a111031p54 for more detail. (Note: the fragment identifier gets broken if we post a live link - so please copy/paste.)
|
Uctqnde

msg:4382267 | 8:09 pm on Nov 1, 2011 (gmt 0) |
This is really interesting. Are they going to base serps on what is trending in comments and of course all of the other variables (location, etc)?
|
wheel

msg:4382268 | 8:15 pm on Nov 1, 2011 (gmt 0) |
In particular, Google can now index Facebook comments |
| OK, so you're going to talk about this next week at pubcon then? Or someone's got to!
|
santapaws

msg:4382277 | 8:42 pm on Nov 1, 2011 (gmt 0) |
wouldn't they need to be a friend to read your fb comments? :) [edited by: santapaws at 9:06 pm (utc) on Nov 1, 2011]
|
tedster

msg:4382279 | 8:48 pm on Nov 1, 2011 (gmt 0) |
As I understand it, this is about the Facebook Comments Plug-in [news.softpedia.com] that's in use on many websites.
|
dududoseo

msg:4382412 | 3:04 am on Nov 2, 2011 (gmt 0) |
The ex-Googler @pedrodias tweeted about it a couple days ago. [twitter.com ] It's realy a good news.
|
Nuttakorn

msg:4382423 | 3:31 am on Nov 2, 2011 (gmt 0) |
Big move - Google will crawl more contents and links. But we need to keep moderate the links on comment (this will create spam opportunity to spammers). Can we set up no-follow links on Facebook comment box?
|
danwhitehouse

msg:4382500 | 10:32 am on Nov 2, 2011 (gmt 0) |
I think this is really interesting too. I wonder if Facebook will put a stop to it asap. Loophole or intentional? Facebook does not want Google on their side at all, just look at their robots.txt
|
Rosalind

msg:4382536 | 12:55 pm on Nov 2, 2011 (gmt 0) |
| Facebook does not want Google on their side at all, just look at their robots.txt |
| Those are pretty sensible exclusions, preventing the indexing of search pages, duplicate content, and so on. Google reports over 5.6 billion results from FB in its index. This is good news for FB if if encourages wider adoption of its comment system.
|
goodroi

msg:4382551 | 1:55 pm on Nov 2, 2011 (gmt 0) |
Be careful to keep this in perspective. Matt said Google can | "...index some dynamic comments..." |
| Matt did not say Google can index ALL dynamic comments. Webmasters should still be careful when adding AJAX to their site.
|
brotherhood of LAN

msg:4382552 | 1:56 pm on Nov 2, 2011 (gmt 0) |
Likely a case of 'it can parse well known frameworks' to varying extent rather than "ajax/js".
|
netmeg

msg:4382554 | 2:03 pm on Nov 2, 2011 (gmt 0) |
I'm glad to see it. Fits in quite well with my evil empire.
|
incrediBILL

msg:4382560 | 2:10 pm on Nov 2, 2011 (gmt 0) |
Skynet could also execute AJAX, hmmmm.... OMG! It's indexing Facebook to find Sarah Connor! [edited by: incrediBILL at 2:23 pm (utc) on Nov 2, 2011]
|
dududoseo

msg:4382571 | 2:18 pm on Nov 2, 2011 (gmt 0) |
| Big move - Google will crawl more contents and links. But we need to keep moderate the links on comment (this will create spam opportunity to spammers). Can we set up no-follow links on Facebook comment box? |
| I think that all links on facebook comments are no-followed and I think you can't change this!
|
flatfile

msg:4382582 | 2:33 pm on Nov 2, 2011 (gmt 0) |
| In particular, Google can now index Facebook comments as well as those on some other social sites such as Disqus or Intense Debate. See [techmeme.com...] for more detail. |
| I use Wikipedia text in some of my image heavy pages, but I load the text through AJAX so bots can't see it. Could this now possibly affect my rankings, if G is able to load that text?.
|
oddsod

msg:4382610 | 3:00 pm on Nov 2, 2011 (gmt 0) |
The big danger here is that Google is now going to index some of the data we thought we had blocked it from crawling. From the GorgBlog [googlewebmastercentral.blogspot.com] Google crawls the URL, yummy-sundae.html. Google begins indexing yummy-sundae.html and...decides to attempt to render the page. During the render, yummy-sundae.html automatically sends an XMLHttpRequest for a resource, hot-fudge-info.html, using the POST method. |
|
|
netmeg

msg:4382742 | 6:53 pm on Nov 2, 2011 (gmt 0) |
| I use Wikipedia text in some of my image heavy pages, but I load the text through AJAX so bots can't see it. Could this now possibly affect my rankings, if G is able to load that text?. |
| If I understood this video from Matt Cutts, something like that would probably be considered cloaking, which could possibly get you into trouble. [youtube.com...]
|
flatfile

msg:4382778 | 8:13 pm on Nov 2, 2011 (gmt 0) |
| If I understood this video from Matt Cutts, something like that would probably be considered cloaking, which could possibly get you into trouble. |
| Well that sounds scary. The way I understood the video is that if you target Googlebot and present it with different data, then that would be considered cloaking. I'm basically showing my pages this way to all users with javascript enabled. Edit Just saw the last part of the video. Guess I'll be changing my pages.
|
netmeg

msg:4382796 | 8:53 pm on Nov 2, 2011 (gmt 0) |
Probably the safest way to go.
|
Sgt_Kickaxe

msg:4382863 | 12:02 am on Nov 3, 2011 (gmt 0) |
This is old news, Google's been indexing words that do not appear on page if you view source for over a year now. I suppose they had to make it official though.
|
Swanson

msg:4382877 | 12:35 am on Nov 3, 2011 (gmt 0) |
I noticed Google indexing my Ajax content more than a year ago. So, if anyone was thinking ajax was a way of hiding content from google then it has been a no-go for ages. I totally hide anything I don't want indexed by Google - that is stuff like ads, dupe content etc. - and yes it is cloaking, but I simply don't care if there is an issue - I need to pay the bills for my family.
|
Swanson

msg:4382879 | 12:39 am on Nov 3, 2011 (gmt 0) |
In fact, I have started doing anything that works - forget Google guidelines etc. or if it is "ethical" and you know what, it pays better than a strategy that looks at ranking well in Google. Over the past year I have shifted to simply thinking about users and then manipulating Google as I can to get traffic to my content.
|
flatfile

msg:4382884 | 12:53 am on Nov 3, 2011 (gmt 0) |
I only added the ajax stuff yesterday, that's why it's easy for me to undo. But, yeah one of my competitors has been using that since ever and is ranking well.
|
flatfile

msg:4383086 | 4:18 pm on Nov 3, 2011 (gmt 0) |
Well, I've decided to put the Wikipedia snipped back. I'm not loading the text through JavaScript this time.
|
1script

msg:4383094 | 4:36 pm on Nov 3, 2011 (gmt 0) |
They've been executing JS for quite some time now (3-4 years I think) to fish for hidden links outside HTML code. I understand that could be done during indexing, i.e. not in real time. But are they now executing it during crawl or still during indexing/processing stage? I mean, those Facebook comments and the like are all time-sensitive info. It would make no sense to execute THAT JS later - so are they going to run all JS on all pages they download right that instant? I did not think there's enough computing power in the world that would let anyone do that, not at their scale anyhow. How about JS with errors? Endless loops and otherwise bad code? Wow, this looks like a can of worm I would prefer to keep sealed ...
|
almighty monkey

msg:4383477 | 12:41 pm on Nov 4, 2011 (gmt 0) |
This is, as others have said, hardly new news. So long as the link contains hard URL data, Google will be able to find it, and has been for some time. I've seen sites, for example, who's JS links contain just a string to add to the end of the URL, for example "parameter=filter1", and Google has tried sticking it on the end of that pages URL and spidering the result. Rather handily, when its a navigation element, Googlebot seemed to get stuck in a loop for a bit, resulting in it spidering pages that had the query "Parameter1 + Paremeter1 + Parameter1 + Parameter1".
|
Hissingsid

msg:4383489 | 1:22 pm on Nov 4, 2011 (gmt 0) |
@dududoseo | I think that all links on facebook comments are no-followed |
| So you think that Googlebot doesn't follow no-follow links ;)
|
lucy24

msg:4383672 | 9:21 pm on Nov 4, 2011 (gmt 0) |
In practical terms... I noticed that the major search engines have started picking up javascript pages. Fine, if it's intrinsic to the page content. But how the ### do you keep them from picking up things like piwik code that they have no business snooping around in? :: not happy about the prospect of manually changing a bunch of links to go along with relocating code snippet to roboted-out directory where it doesn't really belong ::
|
|