Forum Moderators: mack

Message Too Old, No Replies

Is Google able to view my site?

Is Google able to view my site?

         

PumpkinHead

10:04 pm on Apr 6, 2005 (gmt 0)

10+ Year Member



Hi,

Google is now looking at my site, but I'm not sure if it's able to crawl through my full site. How do I tell? I have awstats which is provided by my webhost and a bunch of others but they don't seem to tell me how many/which pages have been crawled. (Just says "1 Hit" 15.97 KB bandwidth..its a small site).

Also, although Google is now looking at my site, I still don't have a description for my site under my google entry. The strange thing is, for 1 day I was on page 7 (with a description) and now I'm nowhere to be found!

Thanks in advance :)

specter

7:32 am on Apr 7, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Hi,

It sounds like a log file problem.Maybe your host cannot provide you an adequate log file.
As In a good log file is reported much more than "1 Hit" 15.97 KB bandwidth..
I too have a small web site hosted without access possibilities to a log file.

As regard of your Google entries,probably in a first crawling the spider has recognized succesfully some pages and regularly indexed with their titles/descriptions,while in a second one it wasn't able.
This could mean a bad quality of your internal links.
Review them making sure that almost the pages you want indexed are linked to the home page by a textual link,as spiders recognize it more easily than other "complex" linkings.

PumpkinHead

12:22 pm on Apr 7, 2005 (gmt 0)

10+ Year Member



Hi,

Thanks for the reply, how would you define "complex" linkings?

PumpkinHead

12:53 pm on Apr 7, 2005 (gmt 0)

10+ Year Member



Just had a look using another stats program. It appears google looks at the following 2 locations...

Firstly..

/robots.txt

(Result http code 404 as it doesn't exist)

Secondly...

/

(Result http code 200)

These are the only 2 locations it looks at. Should I create a robots.txt file?

specter

4:14 pm on Apr 7, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Thanks for the reply, how would you define "complex" linkings?

I would mean linking pages by images,animations and so on.

Should I create a robots.txt file?

It would be a good thing,yes.

topr8

4:25 pm on Apr 7, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



/

(Result http code 200)

this means that google has found your home page and all is well

Should I create a robots.txt file?

robots.txt is about EXCLUDING robots not allowing them - if there is no robots.txt then any robot including googlebot will assume it can spider your entire site - most sites on the web don't have one, it is not needed, should you decide to create one there is a whole forum here at WebmasterWorld dedicated to it.

PumpkinHead

6:15 pm on Apr 7, 2005 (gmt 0)

10+ Year Member



Yeah thought that was the case, might make one just so I limit the number of 404s that appear in my webstats. Looks like all is fine then, will just wait for Google to look through some more pages (when the site gets more popular I presume).

Thanks all :)

cmendla

12:54 am on May 7, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



You can always tell what Google has currently indexed by going to google and searching

site:http://www.yoursite.com -asdasdasdasd

The site: tells it to search only your site. the -asdasdasdasd says to only show those pages that don't have asdasdasdasd on them.

You should have a pretty good idea of how many pages you have in your site. This will tell you how many pages google is 'seeing'

There will be some fallout for duplication on weak pages. For example, I have a page that links to about 50 different pages of widgets. Only about 10% of those pages are in the index.

What happens is that the page with all the links is a little stronger so google indexes that instead of the individual pages.

Chris

MichaelBluejay

5:47 am on May 8, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



You no longer have to include the -asdfasdf or whatever. Simple site:domain.com works fine. Google improved that months ago.

JKMitchell

9:08 pm on May 12, 2005 (gmt 0)

10+ Year Member



From experience Googlebot often seems to read the home page several times over a period of (maybe) 4 or 5 weeks before it gets round to reading the other pages.

This can be very exasperating, especially when at other times the bot comes round in a couple of days and spiders the complete site.

A couple of things you might want to do :-

1. Build a site map that has a link to every page of your site and link to this from the home page.

2. Try and get links from other sites to lower level pages.

Hope this helps