I have a particular product page on a 200 page ecommerce site that's missing in the google index. It has a video on it, and is represented in my video sitemap. A long tail search will produce the video sitemap version of the page (SERP page title matches that submitted on video sitemap - not the HTML title tag).
I want to understand why the page isn't in the primary index - but not sure how to identify the problem. Potential problems in order of likelhood:
1. Dupe content - the main copy for this page has been copied onto 5-10 other sites. 2. Google inspired confusion in the supplemental listings - as I understand it, video search will create a supplemental page in the index for the video component of universal search. My experience says this is still a little beta-flakey. 3. Possible (but unlikeley) that an A/B split test 301ed googlebot to a no-indexed version of the page under a different URL.
Can anyone explain how can I determine which of these potential issues is the culprit?
Will the page self correct if I fix the issue, or do I need to build a new page? If I build a new page can I 301 tie old to the new, or will that torpedo the new?
The SERP shows the html page, but with the video file's title? Very strange, since the url in the video sitemap would be for some form of video format (flv, mp4, swf, or whatever) and not for an html page (htm, asp, php etc.) You'd think they would be kept distinct.
But it could be some back end tangle at Google, I suppose, since the video sitemap does have a <loc> element for the landing page. Can you experiment with making the two titles the same? I'm thinking that Google may work better if all the ducks are in a row.
the url in the video sitemap would be for some form of video format
To clarify - the title shown in the SERP is the title specified as the Title attibute in the video sitemap. The URL is common to the sitemap and the video sitemap (the loc attribute in the latter).
There appears to be two versions of the page recognized, the one associated with the video sitemap, plus the normal one -- even though they have the same URL they appear to be two entities in the index. In tis case, the normal one is gone, gone, gone.
Is that typically seen with dupe content? Or is the duped page left at the bottom of the bit bucket in the index?