Forum Moderators: open
The MediaBot crawls deeper into the site without issue. The site runs AdSense.
Could there be anything in the server config that is causing this? It isn't robots.txt. The index page is lo-fi and xenu crawls it fine, as does the searchengineworld sim spider.
Any ideas?
New site launched middle of February with 2-3 good incoming links to it - Google found the site within a few days and indexed the home page. Almost every day since then Googlebot has been back but only grabbed the robots.txt page and home page.
That changed as of this morning (almost exactly a month after Google first discovered the site) and Google is currently slowly working through the pages of my site. Big relief ;)
Good luck everyone and be patient, I'm sure Googlebot will get to your sites - it's just taking a bit longer that's all ;)
Yesterday it also lost this data but today it reappeared but in the same old version. I cannot find any bot activity in my logs for this site. What is going on?
WRT
For the people who are having this problem and the people who are not, can we post details in the hope a common link will be found.
November - 5000 Google hits/month
January - 2000 Google hits/month
February - negligible Google hits/month
March - Zero Google hits/month
The only traffic I now get from google is my site:www,mydomain.com check :o(
Of the sites that are being ignored, how are you linking your sites? Or, how are you getting them found by Google?
I for one created directories that allow a user to visit any of our sites from any of our sites. But now that I look at it they smack of "LINK FARMS" because we have hundreds of sites? Hmmmmmmmmmm
Anyone else?
-s-
keyword1-keyword2.myoldersite.co.uk
I launched another site last Friday, 12th March, and am already seeing some gglbot crawling to deep pages (ie not just homepage) today, after just these few days. Quite why or how is anyones guess, this is also under a subdomain of the same related subject existing site
keyword1-keyword2-keyword3.myoldersite.co.uk
(myoldersite.co.uk has been in googles index for over a year now)
Both have similar inbounds setup to them, the former site has about 1300 pages with affiliate related links on it, the latter approx 120 pages and showing adsense links down the RHS.
Both sites have a selection of non commercial links out, to content sites on related topics.
The bot I am seeing on the new 4 day old site is Googlebot/2.1, 64.68.82.178, so am I right in thinking this is a proper crawl I'm beginning to see? (I also see the Adsense Mediapartners bot as expected)
Last Wednesday (10th) I launched another different site, also a subdomain, and that too is seeing some limited deepcrawls today from 64.68.82.136.
Are both these 64xx ips the pukka deepcrawling bots?
thanks
DoU
A couple new sites that had been waiting for many weeks are finally in the index.
Another site that has been waiting for an update (new URL's, titles etc.) had 100+ new pages included as of this morning.
I don't see a major shake-up in the search results yet though, and the brand new pages aren't ranking well (whereas the updated ones are doing fine). The new pages only show up in searches for very specific long phrases copied from the page text.
One strange thing: When I do a site:www.mydomain.com search, only a handful of the new pages are shown, and the others are lumped together under the "repeat search with omitted results included", even though they have very different content, titles, etc., with static URL's. I've never seen this happen to this extent.
Does anybody else have the impression, that brand new pages are not ranking to their full potential yet?
I also started 3 new projects at the same time and finished all of them mid of feb. And this time i have all of them indexed in google (Not full site) but more than 70% pages are there in SERPS. People have to wait for some more time to get the site crawl by GB.
Best of luck to All
Thans
Exp...
I not sure I'm ready to start exploring any conspiracy theories with regards to whois, dmoz.
It would be nice to hear from GG with some explanation. Are they retooling the crawlers so there has been less crawl activity lately? Is there some sort of a tweak in the crawl algo?
My whois info is public.I not sure I'm ready to start exploring any conspiracy theories with regards to whois, dmoz.
Thank you for answering my question, which was in no way intended to start a conspiracy theory. I am however intrigued by your mention of dmoz. Am I missing something?
added ...woops, checked the previous page and I see what you are getting at about dmoz. Sorry about that.
I'm experiencing the same things with a site we can consider new: in fact the site has more than two years but till this january was not SE friendly (it was not user friendly also!) -it had very long variables in the URL and used frames. I rewrite the entire code the first days of this year, took out the frames, and set urls with only two variables max.
The site is indexed in several directories as yahoo, dmoz, etc, but added fresh external links in a daily crawled site, and googlebot came very soon, only to visit the homepage.
During January, googlebot never returned, but the new homepage was indexed very soon. Then i realized (thanks to webmasterworld!) i couldn't get success passing variables named "ID" trough the URL, and fixed this the first days of february. Googlebot returned, again, only to visit the index, but came daily six or seven times. And thanks to other external link, also indexed an internal page. The last two weeks of february, i had no news of googlebot. (The homepage is changed everyday).
So, starting march, i decided to use the apache rewrite mod to show urls as if they were static. Since i did so, googlebot is coming everyday but only to reach the homepage. This is for the last two weeks. This daily visit was very regular up to march 16th. From that day, googlebot didn't return.
After reading this topic i'm expecting a deep crawl soon. If it happens, i'll tell you. If you think there's any way of helping that, please tell me.
[edited by: Patricio at 2:43 pm (utc) on Mar. 18, 2004]
[edited by: Powdork at 4:06 pm (utc) on Mar. 18, 2004]