homepage Welcome to WebmasterWorld Guest from 54.235.16.159
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Subscribe to WebmasterWorld

Home / Forums Index / WebmasterWorld / Webmaster General
Forum Library, Charter, Moderators: phranque & physics

Webmaster General Forum

    
Simulating a bot
Possible to do?
trillianjedi




msg:385016
 4:43 pm on Nov 7, 2003 (gmt 0)

I want to see how googlebot sees a particular page.

Still got this strange problem of google indexing pages, but cacheing a completely different page.

I've used IE with all cookies etc etc disabled.

Anything else I can try to work out what is happening?

Thanks,

TJ

 

Strange




msg:385017
 6:02 pm on Nov 7, 2003 (gmt 0)

The quickest way I have found to see what the spider sees is by using a good text browser such as Lynx. It is available for most OS platforms. You can also use Sim Spider tool [webdevstore.com]. It shows your text, links, and meta.

Mohamed_E




msg:385018
 7:17 pm on Nov 7, 2003 (gmt 0)

The old wisdom was that Googlebot saw what Lynx sees. We know that Googlebot is beginning to understand some Javascript (enough to find links in some cases) and some suspect that she is learning enough CSS to detect some forms of invisible text.

So while I do not have any suggestions to answer the immediate question, I am a bit leary of the old assumption that what Lynx sees is what Googlebot sees. The same, obviously, applies to bot simulators.

Gus_R




msg:385019
 4:45 pm on Nov 8, 2003 (gmt 0)

but cacheing a completely different page

Any kind of cloack process? (not specifically for se's).
On-session variables involved?
What about visitors, same problem for them?

Also, good sim spider [ranks.nl]

trillianjedi




msg:385020
 5:12 pm on Nov 8, 2003 (gmt 0)

Any kind of cloack process?

Yes. And one specifcally for SE's - a 301 redirect in case any SE's come in via a session ID.

We haven't used SID's for some months now, but google seems to still have a few in it's cache.

Any thoughts?

GusR - no sim spider that I could find on that link.

TJ

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / WebmasterWorld / Webmaster General
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved