Forum Moderators: phranque

Message Too Old, No Replies

Analyzing Yahoo/Inktomi's & Google Ranking Algorithms

Will The algorithms change and if how for Yahoo/Inktomi

         

aaaaa

6:55 pm on Jan 2, 2003 (gmt 0)



;)
Google is now reported to be responsible for 70% of the search engine results - because of it's databases being "powered" for:
--> YAHOO - web search
--> AOL
--> NETSCAPE
--> IWON
--> EARTHLINK
etc...
If Yahoo uses Inktomi's search as it's default search - as it recently now uses Google - it may be vital to accummulate as much information on the new or changed ranking algorithms that will be used.
As much as Google is now analyzed, will "link popularity"
and "popularity of links" and Title - keyword order be as much of a prority.
Will Meta Keywords and Meta Description Tags play the same role in Inktomi (Google is now acknowledging Meta Description Tags).
Any updates or analysis anyone could bring would be helpful to all SEO's if there are any changes to Yahoo search.
Even if Yahoo continues it's relationship with Google, the fact the MSN and HotBot uses Inktomi will still put the information to good use!

2_much

6:59 pm on Jan 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Good points AAAAA. Even though we have no idea how Yahoo will use Inktmomi, it seems most people are preparing for this by starting to optimize for Ink again.

From my epxerience, Ink is using link popularity extensively. I think they also give a page points based on how long its been in the database. Plus the normal stuff - keyword density, keyword in all the important places, etc.

Mohamed_E

8:53 pm on Jan 2, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



When I started my web site Ink was a pure PFI engine, so I completely ignored it. Since last August it has started spidering the web, and my site got included. I have made no attempt to "optimize for Ink", but my pages show up in roughly the same positions on the Ink and Google SERPs.

So I suspect that a site optimized for Google should do well in Ink with no further work.

aaaaa

4:31 pm on Jan 3, 2003 (gmt 0)



Google prioritizes EXACT keywords in Title (no stemming)

and Exact KeyPhrase or Keyword Order in the Title.

Google EXCLUDES words like: the for etc. in the ranking Algorithmic results.

(Pictures Beatles will yeild a much different result than Beatles Pictures or Beatles Picture)(Beatles will yeild the same as 'The Beatles')

Also, Google demands that EVERY.
word in the query be on the Resulting WebPages, with very very few exceptions. .

Additionally, looking at Google's Adwords Keyword Suggestion
Tool, there are substantial difference in the Overture's Results (which focus on Yahoo Msn etc)

Also, will a Dmoz Listing have the significant effect that it currently has on Google.

Lastly, if Looksmart declines in popularity - Inktomi's results will graduallly become more important on MSN (the third most used search service) - the combined result could make an Inktomi listing more or equally valuable as a Google Listing.

willybfriendly

6:47 am on Jan 4, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hey aaaaa,

"Also, Google demands that EVERY. word in the query be on the Resulting WebPages, with very very few exceptions."

Check the cached pages on Google. It is common to get this message -

"These search terms have been highlighted:

These terms only appear in links pointing to this page: "

In fact, it is not that uncommon to find pages without ANY of the words in the query on the page. The secret is anchor text. Very important point to remember when it comes to Google.

Regards

aaaaa

6:21 pm on Jan 4, 2003 (gmt 0)



Google now uses Meta description and alt tags in it's ranking prioritization, I have noticed that in many instances (view source) I have found the words there.

Google also has an interesting "result":

if there is more than ONE space between two keywords in a Google Search - the results are sometimes slightly different...

Also, when doing an "Advanced Search" on Google you will often get more site (usually extra webpages belonging to a particular site - than you get doing the default 10 results-per-page. So a given webpage may be several places further down in the results.

Additionally, Yahoo, Aol, Iwon etc. are usually limited to offering "WebSite" results (compact) while a Google search will sometimes "pull" webpages with lower pagerank up right under the main page for a given search result if the title is similar.

Lastly, Geocities sites that have nothing to do with one another will often benefit from this - but - be absent from Yahoo, Aol etc for the same search - because they are programmed as being "WebPages"

aaaaa

6:00 pm on Jan 28, 2003 (gmt 0)



:) adwords.google.com/select/main?cmd=KeywordSandbox

compare Google's keyword suggestion tools that searches Aol and Google among others to...

:) inventory.overture.com/

which searches Yahoo Lycos HotBot Msn among others

The results are extremely valuable for complete search engine optimization and also interesting to see the differences in the keyword search frequencies of their searchers

aaaaa

6:31 pm on Feb 3, 2003 (gmt 0)



www-cw.google.com

GOOGLE recently has added a new database center (number eight)

aaaaa

5:00 pm on Feb 12, 2003 (gmt 0)



Inktomi Corp. scheduled a special meeting Yesterday for its stockholders regarding its merger agreement with Yahoo! Inc.
Yahoo! plans to buy Inktomi for $235 million, or $1.65 per share,

running scared

5:54 pm on Feb 12, 2003 (gmt 0)

10+ Year Member



aaaa - you trying to rack up those post numbers quickly ;)

Interesting point about different key phrases used by users of the different search engines. Multiple multiple word phrases are very common on Google. Could optimisation of Ink mean having to concentrate on shorter phrases more? Will Yahoo users already be learning to go for longer phrases as a result of having Google SERPS?

aaaaa

8:36 pm on Feb 20, 2003 (gmt 0)




A new flaw in Google Search


Google ignores common words and characters such as "where" and "how", as well as certain single digits and single letters, because they tend to slow down your search without improving the results. Google will indicate if a common word has been excluded by displaying details on the results page below the search box

placing the word "and" between two search terms will give you significantly different results.

Googles' ranking algorithms then change dramatically.

aaaaa

8:47 pm on Feb 27, 2003 (gmt 0)



Japan's Largest Portal Joins Google's Growing Sponsored Links Distribution Network

MOUNTAIN VIEW, Calif. -- November 18, 2002 -- Google, developer of the award-winning Google search engine, today announced that Yahoo! JAPAN will join Google's network of Japanese advertising partners. Google will also continue to provide Yahoo! JAPAN users with web-wide search results, including more than 100 million Japanese language web pages, for search queries conducted on www.yahoo.co.jp.

Yahoo! JAPAN users will soon see sponsored links provided through the Google AdWordsTM advertising program.

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

with Overture purchasing AltaVista , and AllTheWeb

if Overture uses the same format that it has used with their Go.com, Excite.com, WebCrawler.com

(using their listings FIRST , then Inktomi second)

Inktomi may be POSSIBLY powering:

Lycos
AllTheWeb
Yahoo
AltaVista

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Looksmart may be undergoing it's own changes - check out it's New Cleaner Look!

aaaaa

7:36 pm on Feb 28, 2003 (gmt 0)



Google was granted a patent by the United States Patent Office for a method of determining the relevance of Web pages in relation to search queries.
The patent, filed on Jan. 30, 2001, and was granted Tuesday.

Google turns up an initial set of documents related to the keyword and then ranks each page with a "relevance score." Next, it calculates a

"local score value" that quantifies "an amount that the documents are referenced by other documents in the generated set of documents
. Finally, the local score values influence the relevance ranking of a page.


"a search engine modifies the relevance rankings for a set of documents based on the interconnectivity of the documents in the set. A document with a high interconnectivity with other documents in the initial set of relevant documents indicates that the document has 'support' in the set, and the document's new ranking will increase. In this manner, the search engine re-ranks the initial set of ranked documents to thereby refine the initial rankings."

this topic was spidered and is now #17 on Google, for the term "google ranking"
[google.com...]

wensing

11:57 pm on Apr 10, 2003 (gmt 0)

10+ Year Member



Does that mean that this methodology is patented, or simply the underlying (and hidden) algorithms they employ to perform searches using that methodology?

--A curious search engine designer

aaaaa

5:54 pm on May 10, 2003 (gmt 0)



[www-fi.google.com...]

Google newest(9th) DataCenter (out of Finland)

a new office and DataCenter is now openning in NYC (times square)

possibly: www-ny.google.com?

aaaaa

8:00 pm on May 15, 2003 (gmt 0)



[dbpubs.stanford.edu:8090...]

Extrapolation Methods for Accelerating PageRank Computations

2003

We present a novel algorithm for the fast computation of PageRank, a hyperlink-based estimate of the ``importance'' of Web pages. The original PageRank algorithm uses the Power Method to compute successive iterates that converge to the principal eigenvector of the Markov matrix representing the Web link graph. The algorithm presented here, called Quadratic Extrapolation, accelerates the convergence of the Power Method by periodically subtracting off estimates of the nonprincipal eigenvectors from the current iterate of the Power Method. In Quadratic Extrapolation, we take advantage of the fact that the first eigenvalue of a Markov matrix is known to be 1 to compute the nonprincipal eigenvectors using successive iterates of the Power Method. Empirically, we show that using Quadratic Extrapolation speeds up PageRank computation by 25--300\% on a Web graph of 80 million nodes, with minimal overhead. Our contribution is useful to the PageRank community and the numerical linear algebra community in general, as it is a fast method for determining the dominant eigenvector of a matrix that is too large for standard fast methods to be practical.

[dbpubs.stanford.edu:8090...]

Topic-Sensitive PageRank

2002


In the original PageRank algorithm for improving the ranking of search-query results, a single PageRank vector is computed, using the link structure of the Web, to capture the relative "importance" of Web pages, independent of any particular search query. To yield more accurate search results, we propose computing a set of PageRank vectors, biased using a set of representative topics, to capture more accurately the notion of importance with respect to a particular topic. By using these (precomputed) biased PageRank vectors to generate query-specific importance scores for pages at query time, we show that we can generate more accurate rankings than with a single, generic PageRank vector. For ordinary keyword search queries, we compute the topic-sensitive PageRank scores for pages satisfying the query using the topic of the query keywords. For searches done in context (e.g., when the search query is performed by highlighting words in a Web page), we compute the topic-sensitive PageRank scores using the topic of the context in which the query appeared.

aaaaa

7:41 pm on May 17, 2003 (gmt 0)



The new overture keyword suggestion tool for April is now online

compare the results with the new Yahoo! keyword suggestion tool that comes up on General Yahoo Searches

-- Related: --> More... --> Show All...

compare the results for the phrase

"search engine"

Yahoo!

lyrics search engine, google search engine, free search engine,
search engine submission, search engine optimization, yahoo search engine,
mp3 search engine, search engine ranking, search engine listing, site search engine,
search engine submit, best search engine, search engine placement,
search engine registration, image search engine, search engine positioning,
search engine software, search engine marketing, xdcc search engine,
malaysia search engine, music search engine, altavista search engine,
web search engine, internet search engine, midi search engine, uk search engine,
song lyrics search engine, what is a search engine, msn search engine,
register search engine, medical search engine, movie search engine,
picture search engine, video search engine, chinese search engine,
australian search engine, business search engine, college search engine,
russian search engine, indonesia search engine, file search engine,
aol search engine, meta search engine, photo search engine, asp search engine,
canada search engine, top search engine, search engine code, search engine directory,
japanese search engine, christian search engine, search engine promotion,
science search engine, arabic search engine, price search engine, php search engine,
philippine search engine, dogpile search engine, search engine script,
book search engine, search engine strategies, irc search engine, car search engine,
search engine source code, news search engine, china search engine,
search engine watch, javascript search engine, goggles search engine,
article search engine, ppc search engine, japan search engine,
underground search engine, spanish search engine, history search engine,
search engine keywords, french search engine, definition of search engine,
search engine results, bible search engine, search engine advertising,
journal search engine, indian search engine, travel search engine,
mamma search engine, real estate search engine, mpeg search engine,
german search engine

=================================================
OVERTURE


Searches done in April 2003
Count Search Term
891632 search engine
182055 best search engine
100591 google search engine
63150 lyric search engine
59341 search engine list
53580 search engine secret
49244 register at search engine
42711 search engine optimization
42141 search engine submission
40441 submit site search engine
34898 job search engine
32113 yahoo search engine
29702 search engine ranking
28438 search engine listing
22793 internet search engine
20487 adult search engine
18437 search engine placement
17589 people search engine
15301 search engine positioning
14671 search engine advertising
13656 free search engine
13555 search engine marketing
12557 free search engine submission
12047 meta search engine
11440 image search engine
9961 search engine registration
9372 mp3 search engine
8525 web search engine
8289 search engine rank
8044 search engine promotion
7792 search engine submit
6984 search engine position
6672 search engine directory
6483 sex search engine
6440 search engine optimisation
6305 pay per click search engine
5509 ftp search engine
5418 search engine optimization company
5370 goggle search engine
5229 search engine services
4790 porn search engine
4769 top search engine ranking
4725 uk search engine
4712 top search engine
4598 major search engine
4028 gay search engine
4023 canadian search engine
3997 search engine statistics
3874 altavista search engine
3790 search engine optimization services
3788 medical search engine
3761 alta vista search engine
3737 picture search engine
3564 engine hardrive phrase search search specific that there will words
3555 music search engine
3467 free people search engine
3377 song lyric search engine
3169 high search engine ranking
3118 search engine traffic
3036 search engine software
2966 australian search engine
2825 video search engine
2803 dog pile search engine
2757 free adult search engine
2748 crack search engine
2709 search engine positioning service
2634 search engine submission service
2623 msn search engine
2601 search engine positioning services
2499 email search engine
2485 college search engine
2479 search engine optimization service
2388 search engine submission software
2321 christian search engine
2296 search engine placement services
2288 top search engine placement
2226 russian search engine
2204 perl search engine
2125 search engine optimization firm
1981 mamma search engine
1871 kid search engine
1825 spanish search engine
1795 midi search engine
1793 genealogy search engine
1776 multimedia search engine
1775 free search engine submit
1711 uk search engine optimisation
1694 search engine web site
1669 engine marketing optimization search site
1657 search engine submission services
1627 employment search engine
1594 excite search engine
1579 search engine marketing company
1568 engine googles search
1554 canada search engine
1544 xxx search engine
1542 european search engine
1473 other search engine
1468 search engine ranking optimization
1428 underground search engine

{Yahoo comprises the majority of OVERTURES RESULTS}

aaaaa

3:14 pm on Jun 9, 2003 (gmt 0)



Google is possibly no longer displaying the recent date
that a given Web Page had spidered - when it has been spidered in the last two days...

You can however, use the "daterange" query to find out when a site was last spidered and which links to the site was last spidered...