New Matt Cutts Patent App - and personal search - Google Search and SEO forum at WebmasterWorld

Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

New Matt Cutts Patent App - and personal search

msgraph

8:12 pm on Feb 22, 2007 (gmt 0)

Most is about your typical personalized search/toolbar with the option to remove pages/sites but this part was interesting...

40. A method, comprising: aggregating information regarding documents that have been removed by a group of users; and assign scores to a set of documents based on the aggregated information.

41. The method of claim 40, wherein aggregating information regarding documents that have been removed by a group of users includes: identifying a set of legitimate users and a set of illegitimate users; and collecting information regarding documents that have been removed by the set of legitimate users.

42. The method of claim 40, wherein aggregating information regarding documents that have been removed by a group of users includes: identifying a set of users with a defined relationship; and collecting information regarding documents that have been removed by the set of users.

Removing documents [appft1.uspto.gov]

tedster

8:22 pm on Feb 22, 2007 (gmt 0)

So the actions of individual users of personal search - taken in aggregate - may affect all search users? Is that how we can read this?

Abstract
...The system may aggregate information regarding documents that have been removed by a group of users and assign scores to a set of documents based on the aggregated information.

[0103] The IR score, link-based score, and remove list score may be combined in some manner to generate a total score that is assigned to a document. The assigned scores may be used to rank the documents (block 1830).

msgraph

8:37 pm on Feb 22, 2007 (gmt 0)

Yep, and if they can nail down legitimate users I can see this happening. Plus not just for spam but for geographic regions like they mentioned. Sites that get falsely identified linguistically/regionally or what not.

The section (Improving Search Results) where you pulled out that 2nd quote is a good read.

Marcia

9:03 pm on Feb 22, 2007 (gmt 0)

if they can nail down legitimate users

The most legitimate, authenticated group of users I can think of are Adsense publishers who put fraudulent, replicated and garbage sites on their filtered list.

incrediBILL

9:05 pm on Feb 22, 2007 (gmt 0)

Isn't this called DIGG?

ashear

9:43 pm on Feb 22, 2007 (gmt 0)

Wow this is going to be interesting to see in action. Sounds like a great Algo on it's way.

bwnbwn

9:48 pm on Feb 22, 2007 (gmt 0)

"Wow this is going to be interesting to see in action. Sounds like a great Algo on it's way."

Intresting suppose I hire a group from another country get them a proxie server or servers so the ip is random and begin hitting my competition with delete actions

wonder how many will be enough to get their standings hurt or diminished...

theBear

9:50 pm on Feb 22, 2007 (gmt 0)

Marcia,

"Adsense publishers"

Many on WebmasterWorld will beg to differ on that one, after all those MFA site comments.

Those are also Adsense publishers.

StickyNote

10:02 pm on Feb 22, 2007 (gmt 0)

bwnbwn,

That was my first thought also.

Other than setting up something like DMOZ with selected human editors, how can you even begin to get non-spammed results?

It seems it would be worth a fortune to remove your competitors.

europeforvisitors

10:17 pm on Feb 22, 2007 (gmt 0)

The most legitimate, authenticated group of users I can think of are Adsense publishers who put fraudulent, replicated and garbage sites on their filtered list.

Why not AdWords "content network" advertisers who put fraudulent, replicated, and garbage sites on their filtered lists?

pleeker

10:32 pm on Feb 22, 2007 (gmt 0)

This sounds to me exactly like the Add/Block personalization that Yahoo has had for a year or two now....

Marcia

10:32 pm on Feb 22, 2007 (gmt 0)

Why not AdWords "content network" advertisers who put fraudulent, replicated, and garbage sites on their filtered lists?

Sorry, thanks for the reminder EFV. I forgot all about that, thinking about all the full Adsense filters with dozens more identical sites with diff domains cropping up to take the place when they're filtered out.

SiteOrigin

10:45 pm on Feb 22, 2007 (gmt 0)

if they can nail down legitimate users

I imagine Google would have little trouble in filtering out illegitimate users. Think about all the information they'll have about a Google account � search history, click history, browsing history (through analytics), just to name a few variables in their arsenal. They'll be able to tell exactly how legitimate a user is and weight their affect on the search results accordingly. If this were a simple �one user, one vote� system, they wouldn't be applying for a patent.

Faking a natural looking Google account using raw CPU power would probably be about as difficult as getting your computer to generate a creative writing essay for you. The way I see it, it would require man hours to create a set of legitimate looking accounts and use them to influence the search results. Man hours that would probably be better spent doing real SEO.

[edited by: SiteOrigin at 10:46 pm (utc) on Feb. 22, 2007]

msgraph

10:49 pm on Feb 22, 2007 (gmt 0)

>Isn't this called DIGG?

Without the fancy pants social interaction.

Good points SiteOrigin.

I don't see what this has to do with Adsense/Adwords filtering but ho hum to each their own I guess.

idolw

11:35 pm on Feb 22, 2007 (gmt 0)

need to open china operations once it's online.
maybe some africa, too.

the word "clicker" is going to have better value ;-)

annej

2:03 am on Feb 23, 2007 (gmt 0)

[0096] whether the user has a relationship (e.g., a paying relationship like an advertiser) with the search engine

This kind of hints that adwords or adsense people might be used.

msgraph

2:24 am on Feb 23, 2007 (gmt 0)

>adwords or adsense people might be used.

True, as verification for actual users, not as what filters they use for their own biz. Site filters for monetary reasons have nothing to do with search results. Sure it can help weed out crap sites but you have many sites that play the adsense ad space cramming game yet have good content.

[edited by: msgraph at 2:26 am (utc) on Feb. 23, 2007]

ronburk

2:25 am on Feb 23, 2007 (gmt 0)

Intresting suppose I hire a group from another country get them a proxie server or servers so the ip is random and begin hitting my competition with delete actions

First, "a" proxy server obviously won't make the IP addresses look random at all. Second, even if you got 100 proxy servers (how much money ya got to spend on this?), you then have to make the activity look "normal". How many deletes per week does a "normal" user perform? What are the statistical norms for other activities Google can detect (searches, Google account activities, Google toolbar activity, etc.).

When there's no penalty for collateral damage, Google can afford to do auto-detection that's pretty good at eliminating the bad guys. Just ask anybody who got auto-banned from AdSense because one of their students went to the computer lab and clicked on their ads for an hour every day.

ashear

5:42 am on Feb 23, 2007 (gmt 0)

bwnbwn and StickyNote,

As someone who used to work at a search engine I see the patents as advancements. Yes this will make our lives as SEO's a little more difficult but adaptation is a key that has driven this business for years.

Most of the old SEO's who realized that they couldn�t cheat as easily have dropped out. The one thing I have learned is that if you have to use "Special Tactics" to drive rankings it will most likely back fire at some point or another.

As marketers in general we know that if you can build a solid offering that is compelling your business will thrive. SEO seems to be moving in this direction every year. This theme is actually exciting me rather than scaring me.

I think personal search will be a great function and will help move us forward.

I am sure people will try to game this system to take a short cut, but as we know Google and other engines are very good at detecting and eliminating this over time.

I am going to think about this more and blog about it, best of wishes.

First, "a" proxy server obviously won't make the IP addresses look random at all. Second, even if you got 100 proxy servers (how much money ya got to spend on this?), you then have to make the activity look "normal". How many deletes per week does a "normal" user perform? What are the statistical norms for other activities Google can detect (searches, Google account activities, Google toolbar activity, etc.).
When there's no penalty for collateral damage, Google can afford to do auto-detection that's pretty good at eliminating the bad guys. Just ask anybody who got auto-banned from AdSense because one of their students went to the computer lab and clicked on their ads for an hour every day.

I think that this will be based upon login information as well. It is very interesting that Google was requiring a referral to join Gmail. Now they are asking for an active mobile number. I am sure that their central user id system will have a direct impact on votes. I would also imagine that it may take hundreds of extremely unique searches and votes to validate a site or not.

StickyNote

8:41 am on Feb 23, 2007 (gmt 0)

Google and other engines are very good at detecting and eliminating this over time.

Ashear, Thanks for the input. There is a knee jerk response anytime I hear of user involvement in rankings. It is more or less inevitable, and no more open to gaming than link voting, if you think of it that way.

rj87uk

8:54 am on Feb 23, 2007 (gmt 0)

doing real SEO.

I think that SEO in general could be on its way out in google to a degree, for example I will guess they will be looking at the way the user interact with the website they are on, how long they spend on a site and if they buy and so on so the point i am getting at would be to make your website as good as possible and if the users like it based on its history then you'll rank better or atleast get a good "quality score" ;)

trillianjedi

10:23 am on Feb 23, 2007 (gmt 0)

Wow this is going to be interesting to see in action.

We know they've been collecting the data to do this for at least two years. We've discussed it here before. The "removal" is users clicking on the "remove this from the search results" link in google searches.

The patent was filed in August '05.

Google will not have waited until the patent got granted (or not) before actually using it in the field.

I wouldn't expect any wholesale changes that we haven't already seen.

So the actions of individual users of personal search - taken in aggregate - may affect all search users? Is that how we can read this?

I believe that's about the size of it.

I'll see if I can dig up the old threads.

Just for absolute clarity, this is not a new patent application. This is a patent application from August 2005. The only thing that has changed is that it just got granted by the patent office.

Gomvents

12:14 pm on Feb 23, 2007 (gmt 0)

trillianjedi, It's nice to see some sanity here once in a while. I see all these crazy knee-jerk reactions and people saying "SEO is dead" every day. Of course it's not. Google is also a huge company now, a multi-Billion Dollar public US Corporation. It's not feasable for them to make sharp algorithmic changes without knowing for sure whether or not it'll be in the best interests of searchers. Google is a big ship, it takes months to turn if you get my analogy here... Google has been utilizing the contents of this patent for very long, the only change is now they can sue another company if they try to use the same method to order their search engine results pages. Hope that cleared up a few items for most people here, if not ask away.

mistah

12:17 pm on Feb 23, 2007 (gmt 0)

I just did a search on Google UK and this message popped up at the top of the results:

"New! Google finds the search results most relevant to you, based on your search history. Learn more."

It links to this page: [google.com...]

Its probably been around for ages and I just haven't noticed it. However it's interesting that they've decided to promote it now.

Miamacs

12:39 pm on Feb 23, 2007 (gmt 0)

That message pops up every once in a while if you clear the cache/cookies of your computer. A different message offers me to donwload the toolbar if I visit a Google.not.com for the first time, even though I have it installed.

I remember people discussing this issue of excluding results when they promoted the custom search as "build your own niche search engine". People knew that this was about collecting data ever since last year, and Google knew that we knew so they are probably very cautious using anything they gathered.

elguiri

12:59 pm on Feb 23, 2007 (gmt 0)

doing real SEO.

I think that SEO in general could be on its way out in google to a degree,

I disagree. I think it will just be that the definition of SEO will change.

It will include (well, it already does)

1. helping your clients bait links to develop "natural" links (whatever they are),
2. reviewing content not only for keyword placement, density and semantics, but also stickiness i.e. how well a site responds to a search query,
3. helping your clients get the balance right between monetising your site traffic immediately and developing a longer term relationship with the user,
4. Writing titles and manipulating the Google snippit so that you not only get higher rankings but also a higher CTR (which in turn will get you higher rankings)

I could go on, but I think that makes my point. For some time, good SEO has not been simply a question of tagging, link building and doing metrics on page content. Increasingly it incorporates more creative and subjective elements. SEO's are becoming real professionals.

Doing it properly means having a certain security that, whatever algo changes are afoot, your clients are going to come off better.

SullySEO

9:18 pm on Feb 24, 2007 (gmt 0)

So the actions of individual users of personal search - taken in aggregate - may affect all search users? Is that how we can read this?

This has been my understanding for the past 1 1/2 - 2 years. This is the holy grail for search engines.

Personally, I'm thrilled.

fishfinger

10:16 am on Feb 25, 2007 (gmt 0)

no more open to gaming than link voting

Which is INCREDIBLY open to gaming :)

High volume crappy links still work in Google in competitive markets (debt, hosting).

Google is not as clever (yet) as lots of people give it credit for. Google isn't intuitive - it's not AI. It has to use simple 'yes/no' rules (however many, however inter-related) to make its decisions. If they are going to try to look for pointers of human / natural activity these have to be broken down into simple patterns that can be identified. These could be re-created. Google have to know that.

In an internet full of keyword-dense pages, their algorithm based on links was perfect - until people figured out how it worked. IF they implement a system like this, it will be cracked and the information put out there in time. Then it will be open to abuse.

fishfinger

10:28 am on Feb 25, 2007 (gmt 0)

they will be looking at the way the user interact with the website they are on, how long they spend on a site and if they buy and so on

Say I want train time info; I do a search, get my info in 5 seconds and I'm gone. I want to order a pizza and do a local search; I get the phone number and I'm done. I've just spent 1 hour reading user reviews of the best, cheapest MP3 players and been recommended to use 'mp3s-r-us.com' and buy a particular model. I go to the site and buy the model I want straight away. Are these useful sites? Yes.

Or how about I click away in less than 30 seconds because the page loads too slow, or it's MFA or a sex site, or just not quite what I wanted? Are these useful sites? No.

Suppose I spend 30 minutes looking for information all over a site and click deep into it before giving up? Is that site useful?

Should I even be trusted? Suppose I'm an idiot and can't find the useful information right infront of me?

To try to understand human interaction with a website as a guage of 'usefulness' is just so far beyond Google (or anyone) at the moment to be laughable.

activeco

10:53 pm on Feb 25, 2007 (gmt 0)

To try to understand human interaction with a website as a guage of 'usefulness' is just so far beyond Google (or anyone) at the moment to be laughable.

Don't forget that their original way of classifying popularity by comparing "votes" of another content providers was very successful, although a kind of naive for some too. I don't think that the broadening of the model by including users/consumers in the voting process will be a wrong move.
Actually, I am pretty sure, we have been witnessing it already for some time.

This 33 message thread spans 2 pages: 33