Forum Moderators: open

Message Too Old, No Replies

inclusion in regional Google indices

possible improvements

         

danny

5:25 am on Aug 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



From all accounts, web sites are included in Google's regional sub-indices (e.g. "search pages from the UK") based solely on domain name and IP address (ie, on where they are hosted). This is an understandable way to do the selection, since it's easy to automate, but it is obviously going to exclude sites which have opted for com/net/org domains and/or hosting overseas.

I can think of several possible improvements. One that several people have suggested is using e.g. DMOZ categories to select pages. This would require manual selection however, and ongoing maintainance as DMOZ updates and extends.

Another idea would be to allow webmasters to add META tags requesting pages be included in a particular country index. There would obviously be room for abuse here, most obviously with United States sites including a non-US tag just for some extra traffic, but as more regional indices (ie a "United States" or "North America" search!) are added that should decline.

Another option would be to start with a core set of pages selected on domain/IP address, but in addition to those to index pages which have a majority of incoming links from that core. So if a page in blah.com has more than 25% of its links from pages in (say) the core www.google.ie index, it would also be included in Google's "pages from Ireland". This would add some computation to index building, but not I think that much.

Anyone got any other good ideas?

shelleycat

5:37 am on Aug 31, 2002 (gmt 0)

10+ Year Member



So if a page in blah.com has more than 25% of its links from pages in (say) the core www.google.ie index, it would also be included in Google's "pages from Ireland".

I started off thinking this would be a bad thing as my page has (as far as I know) only one incoming link from another New Zealander. But then I realised that this probably means my page isn't important to other kiwis and therefore it becomes a good thing. If I had really cool local information that kiwis all want then I would probably get lots of NZ links and deserve a place in the local index.

The only other thing I can think of is use of whois information for domain names to pinpoint the location of their owner. But I'm sure this would bring all kinds of privacy issues with it and doesn't take into account shared domains and the like.

tigger

6:12 am on Aug 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



"it's not broken so why fix it"

IMPO I feel it works fine whenever I look for UK only results that's what it offers me :)

dcheney

6:55 am on Aug 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



In the case of sites like my own (non-commerical information site) which happens to cover the entire world (but the contents are only in English, at least for now) - what countries would I indicate? (I get traffic from all over the world - 70+ different countries this month).

ciml

2:42 pm on Aug 31, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



danny:
start with a core set of pages selected on domain/IP address, but in addition to those to index pages which have a majority of incoming links from that core

This is part way to a geographical 'personalised PageRank'.

Each update needs several iterations to converge on a steady PR graph of the Web. At each iteration of PageRank, the rank source could be the 'core' set of matching IP addresses or TLDs. You could then see the 'importance' of a page from a UK, French or German perspective (and Australia, Danny).

That will be very powerful (if they ever get round to it).