Forum Moderators: Robert Charlton & goodroi
Now this is interesting.
A site with 40 000 "real" pages and some 80 000 duplicate content pages excluded using robots.txt (it's a forum - see my prior posts about vbulletin) and still some 80 000 duplicate pages that are not yet so excluded.
Additionally some 500 000 non-thread pages also excluded in robots.txt and most of those already delisted. The whole site is listed as www; nothing is listed as non-www at all.
Looking purely at indexed threads:
site:domain.com shows 90 000 www pages all as normal results; including some duplicate content that will eventually be excluded.
site:domain.com -inurl:www shows 24 000 www pages all of which are marked as Supplemental Results and all of which also have an old cache date. This search should show zero results. It certainly should not be showing www pages at all, the search was for "-inurl:www". What is going on?
[edited by: tedster at 8:44 pm (utc) on June 13, 2006]
[edit reason] split into new thread [/edit]
According to his blog: travelling overseas (work related) from May 27 to June 11 or thereabouts.
I chatted with him at SES on June 1st and 2nd, and I expect that his first week back at work will have a major backlog of email and work so no forum time to be had at all.
As for Matt, he blogged that he was off for 6 weeks, and that was currently less than 4 weeks ago.
not ranking for their most obvious terms, but come up just fine with quotes
I have this problem with one of my sites, a strange thing is though, if I check on mcdar I'm gone for my main term, but there with quotes, at the same time if I don't use the mcdar tool and go straight to the datacentre ip address (64.233.161.107 ) in my browser, I'm back to where I was, top for my term.
My guess is that it is a bug, unless geo targetting is coming into play, as in where you are connecting from. I see this alot when I use country specific proxies in my browser, different results depending on where they think your connecting from.
Has anyone else noticed this with the mcdar tool and then tried connecting directly to the datacentre and your there again?
So does anyone know how ODP titles fit into this mix, I was under the impression that if the ODP titles were used for a result, then every listing would have their ODP title showing, mine was not, but the others were.
I did recently put the MSN
<META NAME="ROBOTS" CONTENT="NOODP" />
tag in place, but thought this was only MSN specific, google would'nt be using this against us would they?
Good hear that you made face to face contact g1smd - I'm restricted to the forum boards :)
-Is anyone seeing their page count re-instated over the last week?
We have one site that's caching and showing on the site:tool
A new site that's fully indexed inside 2 months has me perplexed versus our other established sites which just seem to sit "in suspense" with only a small proportion of pages cached.
I wonder if there is something massive going on and we are only seeing the tip of the iceberg.
[technology.guardian.co.uk...]
Sid
There's something wierd going on with the allin filters.
On 72.14.203.104, a turd DC, for all of these searches
allinanchor:blue widgets
allintitle:blue widgets
allintext:blue widgets
The top 9 remain the same and are the same as the Copra results for blue widgets in each of these searches.
On google.co.uk (216.239.59.104) a Copra DC
allinanchor:blue widgets is the same as the standard search and the same as the above.
allintitle:blue widgets The site at #3 loses its indented listing.
allintext:blue widgets The pages at #1 and at #7 drop out of the top 100 (I stopped looking at 100) and some extra indented listings appear.
When I go directly to 216.239.59.104 or google.com at the same DC IP allintext:blue widgets is the same as the standard search for blue widgets. ie the #1 and #7 still appear.
On 64.233.161.147 a Copra DC
allinanchor:blue widgets #1 drops out, indents disappear
allintitle:blue widgets same as standard results
allintext:blue widgets same as standard results
There is something clearly wrong here. How can one DC have a page #1 for allinanchor and another not list the page at all?
Also the .co.uk filter on allintext is a bit odd don't you think?
The worrying thing for me is the fact that my page is the one at #1 that erroneously drops out.
Can anyone here confirm or deny these findings?
Any comments?
Sid
Have you also noticed that keywords on an allinanchor are not bolded in the Copra results.
This only normally happens when it is a supplemental result that is being returned (although these arent supplemental results - well at least not marked or cached as such)
First the site got hit by the redirecting googlebug 302, then hijacked, thne the non www issue and now the index page is gone, the site is as clean as it gets, not even exchanging links.
Hmm that is the first time I have seen that, now my domain name(index page) gone and the rest is supplemental, I think now I have seen it all.
First the site got hit by the redirecting googlebug 302, then hijacked, thne the non www issue and now the index page is gone, the site is as clean as it gets, not even exchanging links.
Ditto for my site too, index page is not showing when searched using site: command and this is since yesterday.
66.102.7.147
64.233.161.99
64.233.161.104
64.233.161.107
64.233.161.147
64.233.167.99
64.233.167.104
64.233.167.147
64.233.183.99
64.233.183.104
64.233.183.107
64.233.187.99
64.233.187.104
216.239.39.99
216.239.39.104
216.239.39.107
216.239.53.99
216.239.53.104
216.239.53.107
216.239.57.99
216.239.57.104
216.239.57.107
216.239.59.99
216.239.59.107
216.239.59.104
216.239.59.147
216.239.63.104